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Abstract 

Kudekar et al. proved that the belief-propagation (BP) performance for low-density parity check (LDPC) codes can be boosted 
up to the maximum-a-posteriori (MAP) performance by spatial coupling. In this paper, spatial coupling is applied to sparsely- 
spread code-division multiple-access (CDMA) systems to improve the performance of iterative multiuser detection based on BP. 
Two iterative receivers based on BP are considered: One receiver is based on exact BP and the other on an approximate BP 
with Gaussian approximation. The performance of the two BP receivers is evaluated via density evolution (DE) in the dense limit 
after taking the large-system limit, in which the number of users and the spreading factor tend to infinity while their ratio is kept 
constant. The two BP receivers are shown to achieve the same performance as each other in these limits. Furthermore, a heuristic 
continuum approximation for the obtained DE equations implies that the performance of the two BP receivers can be improved 
up to the performance achieved by the symbol-wise MAP detection, called individually-optimal detection, via spatial coupling. 
Numerical simulations show that spatial coupling can provide a significant improvement in bit error rate for finite-sized systems 
especially in the region of high system loads. 
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code-division multiple-access (CDMA) systems, sparse spreading, spatial coupling, threshold saturation, iterative multiuser de- 
tection, belief propagation, individually-optimal (IO) detection, large-system analysis, density evolution, continuum approximation. 



j> \ I. Introduction 

ODE-division multiple-access (CDMA) systems have been used in the air interface of third-generation (3G) mobile 
■ communication systems (TJ-O. In CDMA uplink, multiple users simultaneously communicate with one base station in 

If} \ the same frequency band. The base station is required to mitigate multiple-access interference (MAI) to detect the desired 



signal for each user. Multiuser detection (MUD) is a sophisticated method for mitigating MAI by utilizing the statistical 
properties of the MAI [4|. Two optimal receivers were proposed 10: One optimal receiver, called individually-optimal (IO) 
£N) 1 receiver, performs the symbol-wise maximum-a-posteriori (MAP) detection. The other optimal receiver, called jointly-optimal 
I | (JO) receiver, is based on block-wise MAP detection. Since the two receivers are infeasible in terms of the computational 
£>. ■ complexity for practical modulation schemes, the main issue in MUD is to construct a suboptimal scheme that can achieve a 
• good tradeoff between performance and complexity. 

As suboptimal MUD, linear receivers with low complexity, such as the decorrelator (DEC) [5| and the linear minimum 
| mean-squared error (LMMSE) receiver 0, [[7], were proposed. An idealized assumption for analyzing the performance of 
MUD is random spreading: All spreading sequences have independent and identically-distributed (i.i.d.) elements, whereas 
pseudo-random sequences are used in practice. The LMMSE receiver for randomly-spread CDMA systems can achieve nearly 
optimal performance for low system load in the large-system limit l8l- fl2l . where the number of users K and the spreading 
factor N tend to infinity while the system load (3 — K/N is kept constant. However, the performance of the LMMSE receiver 
degrades significantly for moderate-to-high system load, compared to the IO receiver. Thus, it is an important issue to construct 
a low-complexity scheme that can achieve nearly optimal performance for moderate-to-high system load. Note that the precise 
meaning of low or high will be noted shortly. 

A breakthrough is the use of iterative MUD based on belief propagation (BP) [13], [14|. Kabashima |15] has proposed a 
BP receiver with Gaussian approximation (GA). It was shown that the proposed receiver achieves nearly optimal performance 
for moderate system load in spite of the low complexity. As a method for guaranteeing the convergence of BP receivers, 
sparsely-spread CDMA systems, or sparse CDMA (SCDMA) systems, have been considered [16|-[19|. In SCDMA systems 
the spreading sequence used by each user is sparse: Only c chips out of N chips are non-zero, whereas the conventional 
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Fig. 1. Landscape of potential energy as a function of multiuser efficiency. The multiuser efficiency for the BP receiver after infinite iterations is represented 
by the large (respectively (resp.) small) balls for uncoupled (resp. spatially-coupled) SCDMA systems in the large-sparse-system-limit. 



CDMA system uses dense spreading sequences whose chips are all non-zero. The main advantage of SCDMA systems is that 
the convergence of BP-based receivers is guaranteed in the large-system limit with c fixed. Montanari and Tse [16| proved 
that the performance of an iterative receiver based on exact BP is equal to the performance of the (soft) IO receiver for the 
conventional (dense) CDMA system below a critical system load /3&p, which will be explained shortly, in the large-sparse- 
system limit, where the dense limit c — >• oo is taken after the large-system limit. Note that the system is sparse even in the 
dense limit since the large-system limit is taken first. Furthermore, a regular ensemble for sparse spreading sequences has 
been considered in ifTHl , |fl9l . The BP receiver for the regular ensemble can achieve nearly optimal performance for moderate 
system load. 

In order to present the aim of this paper, we shall explain the definition of the critical system load /3bp, called the BP 
threshold in this paper. Let us consider the multiuser efficiency (ME), which is a normalized signal-to-interference ratio (SIR), 
as a performance measure of MUD. The density-evolution (DE) equation for the BP receiver in the large-sparse-system limit, 
derived in [16|, |fl9l , characterizes the dynamics of the ME. The DE equation can be regarded as a discrete-time gradient 
system with a potential energy function. Time is analogous to the number of iterations. The potential energy as a function of 
the ME has the unique stable solution for small system load /3. As (3 increases across a critical system load, another stable 
solution emerges to the left side of the original stable solution. The BP threshold (3bp is defined as the supremum of (3th 
such that the BP receiver can achieve the ME corresponding to the rightmost stable solution after infinite iterations for all 
(3 E (0,/?th)- Note that the number of iterations tends to infinity after the large-sparse-system limit. The BP threshold (3bp 
is characterized as the bifurcation point at which the stability of the potential energy changes: The potential energy has one 
stable solution for f3 < (3bp, whereas it has two stable solutions and one unstable solution for (3 > /?bp- In fact, the ME for 
the BP receiver converges to the unique stable solution after infinite iterations for (3 < /3bp- For f3 > (3bp, on the other hand, 
the ME converges to the left stable solution after infinite iterations, since the initial ME is commonly a smaller value than 
the unstable solution (See the large balls in Fig. [TJ. Thus, the BP threshold (3bp is equal to the bifurcation point between the 
monostability and the bistability. 

The aim of this paper is to construct a novel SCDMA system that improves the BP threshold (3bp- For that purpose, we 
utilize a recent excellent achievement on coding theory: Kudekar et al. 11201 have proved that the BP threshold of a low-density 
parity-check (LDPC) convolutional code 11211 over the binary erasure channel (BEC) is equal to the MAP threshold of the 
corresponding LDPC block code (See also [22 1). Since an LDPC convolutional code can be regarded as a spatially-coupled 
(SC) chain of LDPC block codes, this result was referred to as "threshold saturation via spatial coupling" I120) . An improvement 
of the BP threshold via spatial coupling is believed to be a universal phenomenon [23|-[25|. The same phenomenon has been 
observed in many other problems [26 1 — [37 1 . In this paper, we propose spatially-coupled SCDMA (SC-SCDMA) systems to 
improve the BP threshold. 

We shall develop a simple, generic, and heuristic method for characterizing the position of the BP threshold for SC-SCDMA 

systems, instead of providing a rigorous proof. Recently, we have presented a phenomenological study on threshold improvement 

via spatial coupling |25|. The study allows us to specify a lower bound on the BP threshold for SC-SCDMA systems via the 

shape of the potential energy obtained from the DE equation: The BP threshold is above a system load /5 B ^P^(> /3bp) at which 

f so 

the heights of the potential energy at the two stable solutions coincide with each other. The ME for p < /3g P converges to 
the right stable solution after infinite iterations, whereas the ME for (3 > (3 B S p^ may be trapped in the other stable solution 
(See the small balls in Fig. [T). Thus, the BP receiver for SC-SCDMA systems can achieve better performance than that for the 

1 If the potential energy is monostable, the rightmost stable solution corresponds to the unique stable solution. 
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corresponding uncoupled SCDMA system in the large-sparse-system limit, when the system load is between Bbp and B BP ' . 

(SO 

The lower bound B BP will be shown to be equal to a critical threshold pio for the uncoupled CDMA system, called the IO 
threshold in this paper. 

The IO threshold Bio (> Bbp) has been specified via the large-system analysis of the IO receiver based on the non-rigorous 
replica method [ 10 1. The ME of the IO receiver for the uncoupled CDMA system is characterized via the same potential energy 
as in the DE equation for the BP receiver. The IO receiver can achieve the ME corresponding to the rightmost stable solution 
of the potential energy for B < Bio, whereas the ME achieved by the IO receiver corresponds to the left stable solution for 
B > Bio- In summary, the BP receiver for the uncoupled SCDMA system is inferior to the IO receiver for system loads between 
Bbp and (3io, since the definition of the BP threshold Bbp implies that the ME achieved by the BP receiver corresponds to 

the left stable solution of the potential energy for 8 > Bbp- On the other hand, the BP receiver for SC-SCDMA systems can 

(SO 

achieve the same performance as the IO receiver for the uncoupled CDMA system when 8 is smaller than B BP (= Bio). In 
this paper, small system load means that 8 is smaller than one. We refer to SCDMA systems as moderately loaded systems 
if 8 is between one and the conventional BP threshold Bbp- High system load means that 8 is between the conventional BP 
threshold Bbp and the IO threshold Bio- 

We have so far focused on the thresholds in the large-sparse- system limit. It is worth investigating what they indicate for 
the performance of finite-sized systems. The definition of the BP threshold /?bp implies that the asymptotic ME changes 
discontinuously at 8 — /3bp as /3 grows. What does this phenomenological picture indicate for finite-sized systems? The ME 
for the BP receiver never changes discontinuously for finite-sized systems. Rather, numerical simulations in lfl5l . |[T6l implied 
that the ME decreases rapidly like a waterfall when the system load moves from below to above the BP threshold Bbp- The 
slope of the ME as a function of 8 becomes steep around the critical point 8 — Bbp as the system size grows. Thus, the 
system size required for achieving an ME close to the asymptotic one increases as the system load gets closer to the BP 
threshold Bbp from below. In other words, the performance for a fixed finite- sized system gets away from the asymptotic one 
as the system load gets closer to the BP threshold. These arguments may indicate that increasing the BP threshold results in 
improving the performance for a fixed finite-sized system. Numerical simulations will show that spatial coupling can improve 
the performance of the BP receiver for a finite-sized system especially in the region of high system loads. 

We would like to refer to an independent work ll32l : Schlegel and Truhachev proposed another SC-SCDMA system based on 
graph lifting, while we consider sparse spreading [33 1 . Interestingly, the obtained DE equations are the same as those derived 
in this paper. They analyzed the BP threshold in the high SNR limit, whereas we investigate the position of the BP threshold 
for any SNR. 

The rest of this paper is organized as follows: After summarizing the notation used in this paper, we review our previous 
phenomenological study on spatial coupling in Section [TT1 along with several new results. In Section [HI] we first consider the 
conventional SCDMA system. After introducing its factor-graph representation, SC-SCDMA systems are defined on the basis 
of two operations with respect to the factor graph. In Section [IV] two BP-based iterative receivers are derived. One receiver is 
based on exact BP lfl6ll . lfl9l . and the other receiver is a BP receiver with GA [15 |. Section IVl presents the main results of this 
paper. The proofs of several theorems are summarized in the appendices. In Section [VI] the performance of the SC-SCDMA 
systems is investigated numerically. Section I VIII concludes this paper. 



A. Notation 

For a matrix A, A 1 denotes the transpose of A. In stands for the N x N identity matrix. The Kronecker delta is denoted 
by 5ij. For a natural number L and an integer I, is equal to I + kL for an integer k such that < / + kL < L — 1. 

For a random variable x, E[x] and Y[x] denote the mean and variance of x, respectively. The notation p(x) stands for the 
probability density function (pdf) of a continuous random variable x. We use the same notation p(x) for the probability mass 
function (pmf) of a discrete random variable x. The notation x ~ p(x) indicates that the pdf or pmf of a random variable x is 
equal to p(x). The real Gaussian pdf with a mean vector m and a covariance matrix X is denoted by J\f(m, X). In particular, 
the pdf for a zero-mean Gaussian random variable x with variance a 2 is written as 

3(x;a2) = 7ib exp (~^)- (1) 

For variables {a^ S Ai} on a finite alphabet A4, the sum J2ja k } f({ a k}) denotes the marginalization of a function f({ak}) over 
all possible configurations of {ctfc}. Furthermore, the sum z2\ ak stands for the marginalization over all possible configurations 
of {dk>} except for a^. For a conditional pdf or pmf p(x\y), p(x\y) oc f(x) means that p(x\y) is proportional to f(x), i.e. 
there is an ^-independent constant C(y) such that p(x\y) = C(y)f(x). 

Graphs with nodes specified by two indices i and j are considered for SC systems. Thus, the pair (i,j) represents not an 
edge but a node for the SC systems. Furthermore, d(i,j) stands for the neighborhood of the node i.e. the set of nodes 

that are directly connected to the node 
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II. Phenomenological Study on Spatial Coupling 

A. Review of Main Theorem 

We shall review our phenomenological study on spatial coupling [25 1. The study predicts the position of the BP threshold 
for SC systems. Furthermore, it helps us understand why spatial coupling improves the conventional BP threshold. We start 
with the following partial differential equation: 



du rrl , . 2 



i du\ 2 d ( .du 



(2) 



with B{u) > 0. In (O, t > and x £ [0, 1] denote the temporal and spatial variables, respectively. The variable t is analogous 
to the number of iterations. The state u(x, t) corresponds to a performance measure, such as SIR, ME, and so on. Without 
loss of generality, we assume that larger u implies better performance. The parameter 7 > 0, which will tend to zero in the 
following analysis, represents the strength of spatial coupling. The dynamics of the uncoupled system 7 = is characterized 
by a potential energy function V(u), which is assumed to be bounded below. The two functions A(u) and B(u) > are 
arbitrary smooth functions. 

The system (O can be regarded as a continuum approximation of coupled DE equations that characterize the BP performance 
for a one-dimensional chain of L systems. The same approximation was considered in ||24l . Let uij denote BP performance 
at position I (I = 0, . . . , L — 1) and in iteration i. Each state uij is assumed to be recursively given via the states : 
I' = 0, . . . , L — 1} in the previous iteration i — 1. The difference (uij — U;,i_i) is expected to be small except for first several 
iterations, so that one may approximate the difference by 

du , , 

ui,i ~ ui,i-i w — (x,t), (3) 
at 

with u(x,t) = m t i for x = l/L and t = i, which corresponds to the left-hand side (LHS) of (|2). Similarly, the last term on 
the right-hand side (RHS) of (O may be obtained from a second-order expansion of the spatial difference (ui' t i — ui.i), 

I'-ldu l(l'-l\ 2 d 2 u _, r _ 3 , 

as L — > 00. When uij depends on the spatial differences {uv .i-i — u/^-i} in the previous iteration through a nonlinear function 
/({itz'^-i — iti,i-i}), the two functions A(u) and B(u) in d2j depend on the state u(x, t). Otherwise, the two functions would 
be constants. The strength of coupling 7 corresponds to a positive parameter that is inversely proportional to L. Thus, we 
focus on the limit 7^0. 

Remark 1. An example that can be explained by the phenomenological system (O is the SC Curie-Weiss model H24V . However, 
the phenomenological system is not applicable to all SC systems. In fact, the DE equations for spatially-coupled LDPC ( SC- 
LDPC) codes over the BEC do not satisfy the positivity condition of the coupling function B(u) in (0, as noted in H25V . 
More precisely, the coupling function B(u) becomes zero at a certain point, whereas B(u) is positive at the other points. This 
implies that a higher-order spatial expansion is required for understanding the SC-LDPC codes over the BEC. 

Let us consider the uncoupled system 7 = 0. In this case, the partial differential equation (|2) reduces to the ordinary 
differential equation 

Since the state u(x,t) does not depend on the spatial variable x anymore, we re-write it as u(t) for the uncoupled system. It 
is straightforward to find 

jV(u(t)) = -{V'{u{t))Y < 0, (6) 

where the equality holds if and only if V'(u(t)) = 0. This implies that the energy V(u(t)) monotonically decreases with the 
time-evolution of the state u(t). Since the potential has been assumed to be bounded below, the state u(t) converges to a 
(local) minimum of the potential V(u) as t —> 00. 

Suppose that the potential V{u) depends on a positive parameter (3, and that the shape of V(u) as a function of u changes 
with the increase of (3, as shown in Fig. [T] The potential V(u) is assumed to have the unique stable solution for small f3. As 
j3 increases across a critical value of f3, denoted by Pbp, a metastable solution emerges to the left side of the global stable 
solution. The BP threshold for the uncoupled system 7 = is defined as the supremum of f3 t h such that the state converges 
to the rightmost stable solution as t — > 00 for all j3 £ (0,/?th)- When the initial state is smaller than the infimum of the 
unstable solution of V(u) over all /3 > j3 BP , the BP threshold is equal to the critical value /Jbp such that the potential V(u) 
is monostable (resp. bistable) for all f3 < /?bp (resp. j3 > /3bp)- In fact, the state for the uncoupled system 7 = is trapped 
in the left stable solution for > /?bp> since the initial state is smaller than the unstable solution of V(u), whereas it can 
arrive at the rightmost stable solution for /3 < /?bp- Spatial coupling allows the state to escape from the left stable solution 
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and to arrive at the right stable solution for j3 € (/?bp, /Jpfp?" 1 ), in which ft^p^ will be specified shortly, whereas the state may 

(so 

be trapped in the left stable solution for p > /3g P . The goal of this section is to elucidate the mechanism of escaping from 
the left stable solution and to specify the position of /3g P . We hereafter focus on the case p > /3bp, in which the potential 
V(u) is bistable. 

Let Mi and u t denote the left (smaller) and right (larger) stable solutions of V(u), respectively. The boundaries of the state 
u(x,t) are assumed to be fixed to the right stable solution, i.e. u(0,t) — u(l,t) = it r for all t > 0. Furthermore, we impose 
an initial condition u(x, 0) = uo(x), with some function uq{x). The state u(x,t), governed by ©, moves around in a space 
of functions on [0, 1] as t increases. If uq(x) is smaller than the unstable solution of V(u) for any x € (0, 1), the state for 
the uncoupled system 7 = is trapped in u(x) = u\ for all x £ (0, 1). Why can the state escape from the left stable solution 
for SC systems? When can the state arrive at the right stable solution w r for all x as t — > 00? Our phenomenological study 
provides answers to these questions. 

In order to answer the former question, we first transform the partial differential equation (O into a canonical system. Let 
f(u) denote a monotonically increasing function that satisfies f'(u) = exp{D(w) + (\nB(u))/2}, in which the function D{u) 
is given by the indefinite integral 

, . f A(u 



B(u) 



du. 



(7) 



It is straightforward to confirm that, by the transformation u 



du 
~dt 



-V'(u) + 7 2 



-\B'[u) 



f{u), the partial differential equation (|2j is transformed into 

du 



<9u x 2 
dx 



d_ 



B{u) 



dx 



with 



v'{u) = nf-\u))v'{r 1 {n)). 

B(u)=B(f- 1 (u)). 



(8) 



(9) 
(10) 



Thus, we can assume A(u) = —B'(u)/2 in (|2), without loss of generality. The partial differential equation (O with A(u) = 
—B'(u)/2 is referred to as the canonical system in this paper. The property f'(u) > implies that the stability of the potential 
energy is invariant under the transformation of variable, whereas the shape of the potential may change qualitatively. Note that 
the argument below is based on the shape of the potential after the transformation of variable. 
The canonical system <J2J with A(u) — —B'(u)/2 can be represented as a gradient system 

-(«), (11) 



where the energy functional H (u) is given by 



H(u) 



dt 



V(u(x,t)) + 



Su 



l 2 B(u(x,t)) 



du 

dx 



dx. 



(12) 



In ( fTTI ). 5/5u denotes the functional derivative with respect to u. 
same appendix, it is straightforward to find 



See Appendix lAl for the derivation of (fTTI . As shown in the 



dH(u) 
dt 



5H 



(u(x,t))\ dx<0, 



(13) 



where the equality holds if and only if SH/5u = 0. This implies that the energy functional (fT2l monotonically decreases with 
the time-evolution of the state. Since (fT2l is bounded below, H(u(x,t)) converges to a finite value as t — > 00. Thus, the state 
u{x, t) is guaranteed to converge to a stationary state u(x) as t — > 00, which is a local minimum of the energy functional (TT2l . 
It is obvious that the uniform solution u(x) = u r is a stable stationary solution to (fTTT i. since the boundaries are fixed to the 
right stable solution u T . The second term in the integrand of (fT2l smooths the state u(x,t) spatially. This smoothing effect 
helps the state escape from the left stable solution and move toward the uniform solution for all x e [0, 1]. The smoothing 
effect will be shown to work even in the limit 74O. 

We next elucidate the answer to the latter question. The following theorem presents a partial answer to the question. 

Theorem 1. Suppose that A{u) = —B'(u)/2, and that the boundary is fixed to the right stable solution u T of the potential 
V(u). If and only if u T is the unique global stable solution ofV(u), the uniform solution u(x) — u T is the unique stationary 
solution to the canonical system (0 in the limit 7 — > 0. 



Proof: See Section IlLBl ■ 
As shown from (1131 . the state u(x,t) converges to a stable stationary solution as t — > 00. Combining this observation and 
Theorem Q] implies that the state u(x, t) converges to the uniform solution when t tends to infinity and then 7 tends to zero, if 
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u r is the unique global stable solution of the potential V(u). In other words, the state u(x,t) can escape from the left stable 
solution and arrive at the right stable solution u T for all x, when u x is the unique global stable solution of the potential V(u). 

Recall that the shape of the potential V(u) with a positive parameter j3 is assumed to change with the increase of (3, as 
shown in Fig. Q] The BP threshold for the SC system is defined as the parameter /3th such that the state u(x, t) converges 
to the uniform solution for (3 < fth> whereas it is trapped in a non-uniform stationary solution for f3 > /3th- Suppose that 
V(ui) = V(u T ) holds at (i = /^ p C) . Theorem [TJ implies that /3< p C) is a lower bound on the BP threshold, since the state 

(SC) (SO 

is guaranteed to converge to the uniform solution for p < /3g P . We hereafter refer to /3 BP as the phenomenological BP 
threshold. 

It depends on the initial condition u(x, 0) = uq(x) whether /3 B p is equal to the BP threshold. In other words, it depends 
on the initial condition whether the state converges to a stable non-uniform solution when the non-uniform solution exists. For 
example, let us consider the initial condition uq(x) — u T . Even if there is a non-uniform stationary solution to © the state 
u{x,t) never converges to the non-uniform solution as t — > oo, since the initial state is a stable stationary solution to (0. 
When the initial state is smaller than the left stable solution u\ in the bulk region far from the boundaries, on the other hand, 
the state is expected to converge toward a stable non-uniform stationary solution if the non-uniform solution exists. The latter 
situation may arise in communication systems, since iterative detection algorithms commonly start with an unreliable initial 
state. Unfortunately, we could not prove the convergence toward a stable non-uniform stationary solution under the latter initial 
condition. If we could prove it, we would be able to present the complete answer to the question: When can the state arrive 
at the right stable solution u r for all x as t —> oo? 

Remark 2. The condition presented in Theorem^ i.e. the global stability of the potential V(u) at u = u T may not be necessary 

for strictly positive 7, whereas it is sufficient for any 7 > 0: The uniform solution may be the unique stable stationary solution 

(SO 

for strictly positive 7, when /3 is slightly larger than /3 BP . Let us consider the limit 7 — > 00 to present an intuitive understanding 
of this statement. The first term in the integrand of M2\ is negligible in the limit 7 — > 00. Thus, the energy functional ( 1721 ) has 
the unique stable solution that satisfies du/dx — and u(0) = u(l) = u r or equivalently u(x) = u r . This observation implies 
that for sufficiently large 7 the state converges to the uniform solution as t — >• oo, regardless of /3. The smoothing effect with 
7 > 0, given by the second term in the integrand of ( 1721 ). helps the state converge to the uniform solution for /3 slightly larger 
than P^f\ 



B. Proof of Theorem Q] 

In order to prove Theorem[T]we transform the canonical system (O with A{u) — —B'(u) /2. Let g(u) denote a monotonically 
increasing function that satisfies 

g\u) = y/B@}. (14) 
Letting u — g(u) transforms the canonical system (O with A(u) = —B'(u)/2 into 



—— = B(u) 
dt v ; 



d 2 u V'(u) 



dx 2 y/B@j 



(15) 



with u = g The newly introduced variable u corresponds to the normal coordinate system in differential geometry 

Let us introduce an effective potential energy function U (u) that satisfies 

U'(u) . -£M (16, 

with u = g _1 ({t). It is straightforward to confirm U(u) = V(g^ 1 (u)) + C, with a constant C. Note that u r — g(u r ) is the 
global stable solution of the effective potential U(u) if and only if u T is the global stable solution of the original one V(u). 

We first prove the sufficiency of Theorem Q] i.e. the uniform solution is the unique stable stationary solution if w r is the 
unique global stable solution of U(u). The following result is valid for any 7 > 0. 

Theorem 2 (Takeuchi et al. 2012). Suppose that A(u) — —B'(u)/2, and that the boundary is fixed to the right stable solution 
u r of the potential U(u). Ifu r is the unique global stable solution of the effective potential U(u), the uniform solution u(x) = u r 
is the unique stationary solution to ( I75I ). 

Proof of Theorem^ We follow ll25l to prove Theorem |2] A stationary solution u(x) to ( fT5l ) satisfies 
with the boundary condition u(0) = u(l) = u T . Integrating ( fTTT i after multiplying both sides by du/dx, we obtain 

t(I) 2 ^<"» + c - ™ 
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Fig. 2. Inverted potential —U(u). 



with a constant C. Since u r is the global stable solution of U(u), the boundary condition u(0) = u(l) = u r and the positivity 
of the LHS on (TT8T > imply C > —U(u T ). Let us prove C = —U(u T ), The stationary solution u(x) must be continuously 
differentiable since it is a solution to the second-order differential equation (fTTI i. From Rolle's theorem, there is a point 
.To £ (0,1) such that <iu/<ia;| 2:=a ; = 0. Evaluating ([Tol l at x = xq yields U(u(xq)) = —C < U(u T ). Combining this result 
with the global stability of u T , i.e. U{u{xq)) > U(u T ), we obtain C = —U(u(xq)) = — U(u T ). Note that the uniqueness of the 
global stable solution implies u(x ) — u T . 

We shall show that the uniform solution u(x) = u r is the unique solution to (TT~8T > with C = — U(u r ). Since u(0) = u(xq) = 
u(l) = u t , from Rolle's theorem, there are two points x on £ (0, x ) and Xoi £ (xo, 1) such that du/dx\ x — xao = du/dx\ x=X01 = 
0. Furthermore, ([18} with C = —U(u T ) and the uniqueness of the global stable solution of U(u) imply u(xoo) = u(xqx] = u T . 
Repeating this argument, we find that the derivative of u(x) is zero at countably infinite points. Since du/dx is continuous, 
this observation implies du/dx — for all x. Thus, the uniform solution u(x) = u r is the unique solution to JT8l or (fTTT i. ■ 

Remark 3. Hassani et al. H24V presented an intuitive argument based on classical mechanics, and obtained results equivalent 
to Theorem [2] We shall review their intuitive argument. The differential equation M7\ is regarded as the Newton equation 
of motion: The state u(x) is regarded as the position of a particle with mass at time x, moving subject to the potential 
energy —U{u). Note that x corresponds to the temporal variable in this interpretation, although x has been defined as the 
spatial variable in the original phenomenological system j2). Expression ( |i<SD corresponds to the conservation of mechanical 
energy. Since the Newton equation (fJ7| is invariant under time translation and under time reversal, the solution u(x) on [0, 1] 
is symmetric about the axis x = 1/2. Without loss of generality, we focus on u(x) for x € [0, 1/2]. 

The boundary condition u(0) = u t implies that the particle is on the right maximum of the inverted potential —U(u) at 
time x — 0. The uniform solution corresponds to the situation under which the particle continues to stay on the right maximum. 
Can the other solutions exist? In other words, can the particle move from the initial position and return to the initial position 
at time x = 1? Since u r is the global maximizer of the inverted potential —U(u), the conservation of mechanical energy 
implies that the velocity u'(x) at time x = 1/2 must be non-zero if the particle moves to some position u{\/2) ^ u r . From 
the symmetry of u{x) about the axis x = 1/2, however, the non-zero velocity v!{x) ^ at time x = 1/2 indicates that 
u{x) is non-differentiable at time x = 1/2. This is a contradiction, since u{x) is the solution to the second-order differential 
equation A17h Thus, it is impossible for the particle to move from the initial position and to return to the initial position at 
time x = 1. These observations imply that the uniform solution u(x) = u T is the unique solution to M7\ . 

Theorem [2] implies that the sufficiency of Theorem Q] is correct. We next prove the necessity of Theorem [TJ i.e. there is 
a stable non-uniform stationary solution in the limit 7 — > 0, if u T is the metastable solution of U(u). Let us focus on a 
non-uniform stationary solution u{x) to ( fT3T > that satisfies du/dx < (resp. du/dx > 0) for x € (0, 1/2) (resp. x £ (1/2, 1)). 
The following theorem guarantees the existence of such a stable non-uniform stationary solution in the limit 7 — > 0. 

Theorem 3. Suppose that A{u) = —B'(u)/2, and that u r is the metastable solution of the effective potential U{u). Let us 
define u\ and u un as u\ = g(u\) and the point u un € ({ti,{t r ) satisfying U(u un ) — U(u r ), respectively (See Fig. 0. For 
sufficiently small 7 > 0, there are two non-uniform stationary solutions u s (x) and w un (x) to ( |75| ). Furthermore, one stationary 
solution u s (x) converges to u\ for x £ (0, 1) in the limit 7 — >■ 0. The other stationary solution u nn (x) converges to u T for 
x 7^ 1/2 in the limit 7 — > 0, whereas M un (l/2) tends to u un in the limit 7 — > 0. In particular, u s (x) is stable in the limit 
7^0. 

Proof of Theorem \3] See Appendix [B] ■ 
Remark 4. The two solutions u s (x) and u un (x) in the limit 7—5-0 can be interpreted in terms of classical mechanics as 



IEEE TRANSACTIONS ON INFORMATION THEORY. VOL. , NO. , 2012 



follows: For u s (x), a particle starts rolling down from the right maximum of the inverted potential —U(u) toward the left 
maximum at time x — 0. The velocity is infinitely large, when the mass is infinitely small. In a moment, the particle approaches 
the left maximum of the inverted potential with vanishing velocity. If the approaching velocity were finite, the particle would 
pass through the left maximum and roll down the left cliff. At time x — 1/2, the particle starts rolling down from the left 
maximum toward the right maximum with infinitely small velocity. Just before time x — 1 the velocity becomes infinitely large, 
and returns to the right maximum of the inverted potential at time x = 1. 

For the other solution u un (x), a particle starts rolling down from the right maximum of the inverted potential to the left side 
with infinitely small velocity at time x — 0. Just before time x — 1/2, the velocity of the particle becomes infinitely large, and 
stops at the point {t un at time x = 1/2, because of the conservation of mechanical energy. If the initial velocity were finite, 
the particle could not stop at the point u un . The particle starts rolling down from the point u un with infinitely large velocity, 
and approaches the right maximum with vanishing velocity at a moment. Then, the particle climbs the hill slowly, and returns 
to the right maximum of the inverted potential at time x = 1. 

Theorem[3]is useful for plotting the stationary solutions to with A(u) = —B'(u)/2. Intuitively, the non-uniform solution 
u s {x) that converges to u\ for all x E (0, 1) in the limit 7 — > represents the situation under which the state u(x, t) is trapped 
around the left stable solution u\ as t — >• 00. We conjecture that u un (x) is unstable, although we could not prove the instability. 

III. System Model 

A. Sparsely-Spread CDMA Systems 

We introduce conventional synchronous SCDMA systems before presenting SC-SCDMA systems. In this paper, the receive 
power is assumed to be identical for all users. Let K and N denote the number of users and the spreading factor, respectively. 
Without loss of generality, we focus on one symbol period. User k sends the product of the unbiased binary phase shift keying 
(BPSK) data symbol b k E {—1, 1} and a sparse spreading sequence s k = (si,fc, ■ ■ • , sjv,fc) T with Ck — E[||sfe|| 2 ], in which the 
statistics of {sk} will be defined shortly. Under the assumption of unfaded channels, the received vector y = (yi, . . . , i/n) T G 
M. N is given by 

V = ^2 —7=s k b k + w, (19) 

fce/c V Ck 

with JC = {1, ... ,K}. In ( fT9l , the A^-dimensional vector w ~ A/"(0, <r 2 /jy) denotes additive white Gaussian noise (AWGN) 
with variance cr\. The expression ( fT9l can be re-written as 

y = Sb + w, (20) 

with S — (c 1 1 s\, . . . , c^ 2 sk) and b = (61, . . . , bx) T ■ 

The conventional CDMA systems use dense spreading sequences whose elements are all non-zero. In the SCDMA sys- 
tem ( fT9l , on the other hand, user k utilizes the sparse spreading sequence s k with c k (<C N) non-zero elements. The number 
Ck is equal to the weight (number of non-zero elements) of the fcth column of the spreading matrix S. For simplicity, we assume 
sparse spreading with binary antipodal chips as non-zero chips: Non-zero elements of s k take ±1 with equal probability. Then, 
the normalization constant c k = E[||sfc|| 2 ] is equal to the average column weight of S. Let r n denote the nth row weight for 
the spreading matrix S. A constraint with respect to the number of non-zero elements imposes Y^,k=i Ck = ^2n=i r «- ^ n tn i s 
paper, we only consider regular and quasi-regular ensembles of the spreading matrix. 

Example 1 (Regular Ensemble). In the {c 1 r)-regular ensemble of S, all column weights {c k } and all row weights {r n } are 
equal to c and r, respectively. The constraint 53fcLi c k = XmLi r n implies that K and N must satisfy the constraint on the 
system load f3 = K/N = r/c. One regular spreading matrix is obtained as follows: 

1) Pick up a matrix from all possible binary (0 or 1) matrices with row weight r and column weight c uniformly and 
randomly. 

2) Replace each non-zero element of the obtained binary matrix by -iic" 1 / 2 independently and with equal probability. 
The (c,r)-regular ensemble of the spreading matrix is composed of all possible spreading matrices obtained in the above- 
mentioned manner. 

The (c, r) -regular ensemble is well-defined when K and N satisfy the constraint on the system load (3 — r/c. The following 
r-quasi-regular ensemble is well-defined for any K and N. 

Example 2 (Quasi-Regular Ensemble). In the r-quasi-regular ensemble, all row weights {r k } are fixed to r. Let N w = rN 
denote the number of non-zero elements in S. The column vectors of S are classified into two groups with column weights c = 
\_N W /K\ and c + 1: One group consists of (N w — cK) column vectors with weight c + 1. The other group is composed 
of the remaining column vectors with weight c. The average column weight c is given by c = r//3. It is straightforward 
to confirm that the constraint N w = X)fc=i c fc is satisfied. One spreading matrix is obtained as follows: A binary matrix is 
uniformly and randomly picked up from all possible binary matrices satisfying the conditions above. Subsequently, the gain 
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Fig. 3. An example of the factor graph picked up from the 2-quasi-regular ensemble for K = 8 and N = 6 (left). The graph on the right represents a 
simplified graph representation for the same ensemble. 

±c -1 / 2 is associated with each non-zero element of the obtained binary matrix independently and with equal probability. The 
r- quasi-regular ensemble of the spreading matrix consists of all possible spreading matrices obtained in this manner. 

Obviously, the r-quasi-regular ensemble reduces to the (rN/K, r)-regular ensemble when 7V W = rN is a multiple of K. 
Thus, we only consider the quasi-regular ensemble in this paper. 

B. Factor Graph Representation 

We next introduce the factor-graph representation [fl"4l for the SCDMA system ( fl9] >, shown in Fig. [3] Each data symbol bk 
corresponds to a variable node represented by a circle, whereas each received signal y n is associated with a function node shown 
by a square. If the nth chip s n k for user k is non-zero, there exists an edge connecting function node n and variable node k 

1/2 

in the factor graph. Furthermore, the edge is associated with the corresponding gain c k s n ,k- Let dk C N = {1, . . . , N} 
denote the neighborhood of variable node k, i.e. the set of function nodes that are directly connected to variable node k. The 
degree \dk\ of variable node k corresponds to the fcth column weight of the spreading matrix S. Similarly, let dn C JC 
denote the neighborhood of function node n. The degree \dn\ of function node n is equal to the nth row weight r n of the 
spreading matrix. 

There is a one-to-one correspondence between a spreading matrix and a factor graph. Thus, we can consider an ensemble 
of factor graphs corresponding to the r-quasi-regular ensemble of spreading matrices. This ensemble is referred to as the 
r-quasi-regular ensemble of factor graphs in this paper, or simply as the r-quasi-regular ensemble if it is obvious that the 
ensemble is an ensemble of factor graphs. 

The crucial property of factor graphs picked up from the r-quasi-regular ensemble is the asymptotic cycle-free (ACF) 
property in the large-system limit, where K and N tend to infinity while the system load j3 = K/N is kept constant. The 
length of a cycle is defined as the number of edges included in the cycle. A factor graph picked up from the r-quasi-regular 
ensemble has no cycles with finite length with probability one in the large-system limit lfl4l . This ACF property guarantees 
the convergence of iterative detection based on BP in the large-system limit. 

C. Spatial Coupling 

In this section, we present an intuitive explanation for spatial coupling as a brief introduction, instead of presenting a precise 
definition of SC-SCDMA systems. The precise definition will be presented in the next section. We use a simplified graph 
representation for the r-quasi-regular ensemble (See the graph on the right in Fig. 0). The graph consists of one function node 
represented by a square and of one variable node shown by a circle. The number of edges is equal to the degree r of the 
function nodes in the original factor graph. In other words, the simplified graph only represents the fact that the degree of the 
function nodes is equal to r. In introducing SC-SCDMA systems, multiple simplified graphs are used. Thus, we refer to each 
simplified graph as a subgraph. An SC-SCDMA system with coupling width W and the number of subgraphs L is constructed 
from two operations with respect to the simplified graph representation. 

1) Generate L uncoupled subgraphs. Different subgraphs may have different spreading factors. 

2) Disconnect Wr/(W + 1) edges out of r edges from the variable node at position I for I = 0, . . . , L — 1. Subsequently, 
reconnect r/(W + 1) edges out of the disconnected edges to the variable nodes at position (I')l for I' — I — W, ... ,1 — 1. 
The subgraphs are connected circularly (See Fig. |4)- 

The degree of the function nodes r is restricted to a multiple of W + 1. In Step [2] L subgraphs have been coupled circularly, 
whereas Kudekar et al. Il20l considered termination at both ends. The point of spatial coupling is that the data symbols at both 
ends are known to the receiver or can be detected well. Reliable information about the data symbols at both ends is expected 
to spread over the whole system by spatial coupling. In order to allow the receiver to detect the data symbols at both ends, we 
reduce the system loads for positions I = 0, . . . , W — 1. In the next section, we present the detailed definition of SC-SCDMA 
systems. 
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Fig. 4. Spatial coupling for r = 3, L = 9, and W = 2. 



D. Spatially-Coupled SCDMA Systems 

MUD for SC-SCDMA systems is performed for every L symbol periods^], whereas it is done for every symbol period in 
the conventional SCDMA system ( fT9] i. This implies that the detection delay increases linearly in L for SC-SCDMA systems. 
In practice, the detection delay does not necessarily result in the overall delay for coded systems: If L is smaller than the code 
length, the overall delay is dominated by the delay due to decoding. 

Let Ni denote the spreading factor in symbol period I. The received vector y l = (yxj, . . . , yN l ,i) T S R^' in symbol period I 
is given by 



EE 



1 

y/Cl,k,l' 



h(i-i') L si t k,i'bk,v + wi, 



(21) 



for / € C = {0, . . . , L - 1}, with 



1/VW + l fml = Q,...,W 
forZ = W+l,.. 



,L-l. 



(22) 



In ( EH . the vector wi ~ jV(0, ct^J-Ni ) denotes the AWGN vector with variance a\. The Ni -dimensional vector si k,v = 
(si,i,k,i', ■ ■ ■ , SN l ,i,k,i') T represents the I'th sparse spreading sequence of user k for symbol period I, which will be defined 
shortly. The normalization constant ci : k : v is given by ci : k,v = ^[\\si,k,i'\\ 2 }- Finally, b^.i G { — 1, 1} denotes the Ith BPSK data 
symbol for user k. Let b; = (61, j, . . . , bK,i) T and Si t i> = {cfx {f si i f , . . . , cfj/f,si t K,l')- The system ( |2TT > can be represented 
as 



(23) 



y " 




' b ' 




w 


- 'g 




+ 




Vl-i_ 








wl-i 



-Temporal coupling may be an appropriate naming, rather than spatial coupling. Nonetheless, we use the term "spatial coupling" in this paper, following 

ED- 
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with 



Sq.o 



So,w • • • So,i 
Sw-i,w 



>w,w 



(24) 



Sl-i,w Sl-i,o 

It is straightforward to confirm that the simplified graph representation in Fig. |4] corresponds to the one for the SC-CDMA 
system with r = 3, L = 9, and W = 2. 

In Section IIII-C1 we reduced the system loads for positions I = 0, . . . , W — 1, compared to the other positions. This 
corresponds to the situation under which the SC-SCDMA system OTT) has two phases: initialization and communication 
phases. The spreading factors Ni for the initialization phase I = 0, . . . , W — 1 are fixed to a large value Ni n a to allow the 
receiver to detect the data symbols transmitted in the phase. On the other hand, the spreading factors Ni for the communication 
phase I = W, . . . , L — 1 are set to a small value N to increase the sum rate. 

The average system load /3 of the SC-SCDMA system is given by 

f- KL 



JVinitW + N(L - W) 
1 



(25) 



prJ t (W/L) + p-i{l-(W/L)y 

where /3j n ; t = K/N-^ an d P = K/N denote the system loads for the initialization and communication phases, respectively. 
The average system load ( |25T l converges to the system load f3 for the conventional SCDMA system ( fT9] l when L and W tend 
to infinity with 7 = W/L — > 0. 

We only consider the quasi-regular ensemble with spatial coupling. Throughout this paper, the matrix (l24l is assumed to be 
drawn from the (r, L, W / )-quasi-regular ensemble below. 

Example 3 (Quasi-Regular Ensemble with Spatial Coupling). In the (r, L,W)-quasi-regular ensemble with spatial coupling, 
all row weights of the matrix ( 1241 ) are fixed to r. Each non-zero submatrix Siy has the row weight r/(W + 1) for all rows. 
Thus, the row weight r of \24\ must be a multiple ofW+1. Each submatrix Si,i' is a member of the r/(W + l)-quasi-regular 
ensemble with the number of users K and the spreading factor Ni, presented in Example [2] The average column weights 
{ci,k,i'} ore equal to ci,k,i' = r/{(W + l)/?init} for the initialization phase I = 0, W — 1 and ci,k,v = r/{(W + 1)/?} 
for the communication phase I = W, . . . , L — 1. The (r, L, W)-ensemble consists of all possible matrices A24\l that satisfy the 
conditions above. It is guaranteed that the (r, L,W)-ensemble of factor graphs corresponding to the (r, L 1 W)-ensemble of 
has the ACF property in the large-system limit, although its proof is omitted. 



IV. Iterative Receivers 

A. Belief Propagation 

The goal of the receiver is to compute the marginal posterior probability for each data symbol bk,v, given by 

P (h, lr \y,G)= ]T P (B\y,G), 

\h,v 

where the joint posterior probability p(B\y, G) of B = {6; : I E £} given y = {y l : I € £} and G is defined as 

P {y\G,B)p{B) 



with 



p(y\G) = Y,r{y\G,B)p(B). 



It is well-known that the IO decision b 



(26) 

(27) 
(28) 

(IO) _ 
k.V ~ 



2~2b k ,,e{-i,i} b k,i'P(bk,i'\y, G) minimize the bit error rate 



In (f27l), the conditional pdf p(y\G,B) represents the SC-SCDMA system 
argmax bfc [t e{-i,i}P(bk,i'\y , G) and the soft IO decision b^j^ 
(BER) and the mean-squared error (MSE), respectively 0]. 

BP is an iterative algorithm for computing the marginal posterior probability (l26l efficiently. Messages are exchanged between 
the variable and function nodes on the factor graph for the SC-SCDMA system (|2TT >. Let q^\{bk,v) (resp. m^\(bk.i')) denote 
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the message passed from the function node y n> i (resp. variable node bk,v) to the variable node bk,i> (resp. function node y n ,i) 
in iteration i. A tentative marginal posterior probability of bk,v in iteration i is given by the product of all incoming messages 
to the variable node bk,i>, 

P®{h,i>\y,G)<x J] (29) 

(n,f)eS(fc,i') 

where the set of index pairs d(k, V) C A/" max x £, with Af max — {1, . . . , max; Ni}, denotes the neighborhood of the variable 
node bk,i'- The BP decision is defined as the hard (or soft) decision based on the marginal posterior probability (|29l in each 
iteration. 

The messages q^\(bk,i') and m^\(bk,i') are updated as follows: 



1n}(h,l') = E p I " ' ' 
\K,i> 



E 



(h,v) 



(30) 



(31) 



n <t i \\vi 

(k,l')ed(n,l)\(k,l') 

Y n,l,k,l' IT 

{fi,l)ed(kM)\{n,l) 

with the initial values m^°] ) = 1/2. In d30l >. the set of index pairs d(n,l) C t x £ denotes the neighborhood of the 
function node y n j. Furthermore, the conditional pdf in (|30l represents the SC-SCDMA system (fJJJi for the nth received signal 
in symbol period I. In Pit . ot£\ k denotes the normalization constant. We hereafter refer to the update rules ( f30b and (l3TT > 
as the sum and product steps, respectively. 

It is known that BP computes the exact marginal posterior probabilities if there are no cycles in the factor graph [13]. 
When the factor graphs have cycles, the convergence of BP is not guaranteed in general. Even if BP has converged, the 
computed marginal posterior probabilities (|29l are approximate. Fortunately, the (r, L, M / )-ensemble of factor graphs has 
the ACF property in the large-system limit: There are no cycles with finite length in the large-system limit. Thus, the BP 
receiver ((29) is guaranteed to converge in the infinite-iteration limit i — > oo after taking the large-system limit. Note that the 
two limits do not commute with each other. 



B. Gaussian Approximation 

The computational complexity of the BP receiver is exponential in the row weight r, whereas the complexity is linear in K 
and L. Consequently, a large row weight r cannot be used in terms of the complexity. We derive an approximate BP-based 
iterative receiver that works efficiently for large r, following fTBI . The update rule (|3Q| > can be regarded as a marginalization 
with respect to the independent variables btj, ~ m ni(^kl') ^ or au> (^>0 ^ d(n,l)\(k,V). Let us define the postulated 



interference to the data symbol bh,v in the function node y n< i as 



I 



(0 



E 



(i) 



(k,l')ed(n,l)\(k,l>) "J + l) c i,fc,Z~' 



(32) 



with b\ ? ~ m 

kV 



n\(^k I')- ^ e cen tral limit theorem implies that the interference (l32l converges in law to a Gaussian random 



variable in the limit \d(n, l)\ = r — > oo. The mean jjy\ . and variance v£\ k of d32l are given by 



E 

(k,I')£d(n,l)\(k,l<) 
,,2 



(W + l)c, 



,-,(*) - v 

n,l,k,V ~ / , 

(k,l')ed(n,l)\(k,l') 



„Lk,l 



l,k,V 
^ n ,l,k,V> 



(W + l)c Lk r, 



(33) 



(34) 



respectively. We use the GA of d32i > to approximate the update rule (f30b by 

Sn,l,k,l'bk,l 



=%(*-!) 



+ 7 (i - 1} 
' 1 n,l,k,l' 



y/(W + l)ci, k ,l 

S n J,k,l'bk,l> , 2 

^nJW'i V n,l,k,l' ' a n 



(35) 



TAKEUCHI et a/.:PERFORMANCE IMPROVEMENT OF ITERATIVE MULTIUSER DETECTION FOR LARGE SPARSE CDMA BY SPATIAL COUPLING 



13 



where g(x 1 a 2 ) denotes the pdf ([TJ for a zero-mean Gaussian random variable with variance a 2 . The complexity of the BP 
receiver with GA ((35} is linear in the row weight r, as well as in K and L. 



V. Main Results 

A. Density Evolution Analysis 

The asymptotic properties of the BP receiver (129} are analyzed in this section. Let us define the equivalent channel between 
bk,i> and the corresponding output in iteration i, denoted by b^\,, as 



p©l&k,i') = fp®(pk,i> = b^ v \y,G) P (y\G,b kiV )dy, 06) 

where the overline represents the expectation with respect to G. The BER and SIR are calculated from the equivalent 
channel ( f36b . 

We consider four limits: the large-system limit, the dense limit r — > oo, the infinite-iteration limit i — > oo, and L, W — > oo 
with 7 = W/L — > 0, which will be taken in this order. We first present the main result in the first two limits, i.e. in the 
large-sparse-system limit where the dense limit r — > oo is taken after the large-system limit. The main result is that the 
equivalent channel (136} converges to the one for a scalar AWGN channel in the large-sparse-system limit. We first introduce 
the equivalent AWGN channel for iteration i, 

43' =&*,!' +4?!'. < 37 > 

with Wk,i> ~ A/"(0, (sir^) -1 ), in which sir ; , will be defined shortly. Let (ph,l')i denote the posterior mean estimator of bk,v 
in iteration i, 

(6fc,i')i= E &JWKM4*,]'), (38) 

where the posterior probability p(&fe,|'l4*]') in iteration i is defined as 

, (,•), P(Zk}>\b k ,i>)p(bk,i>) 

P(h,l'\ z k,l') = 77*71 ' (39) 

P(4'l>) 

with 

k45')= E p(4?i'i&*,i')p(&fc,i')- ( 4 °) 

The MSE £,y for the posterior mean estimator (138} in iteration i is given by 

#=E[(6 fci ,,-<6 fc , r ) i ) 2 ]. (41) 

Theorem 4. Suppose that A24\l is picked up from the (r, L, W)-ensemble, presented in Example \3\ Then, the equivalent 
channel ( 1361 ) for the BP receiver in iteration i converges to the equivalent channel for the scalar AWGN channel ( 1571 ) in the 
large-sparse-system limit: 

pft&M [pQbw = &iSi4:l')p(4Si^')d41'- (42) 



/« ( 1421 ), fne posterior probability p(6fc,j'l4*l') .? !ven ^ (O- conditional pdf p(zj^ { ,\bk,i') represents the scalar AWGN 
channel ( 1571 ) in iteration i. In evaluating these expressions, the asymptotic SIR sir^, is given Z?y 

+ 1 ' erf, , „, m 



i=0 (W)i7 



where {of (i)} satisfy the following equations. 



R W 
l'=0 

with 4° } = 1 far all V G C. In $44}, fa = K/Ni is equal to /3 init for I = 0, . . . ,W - 1 and to for I = W, . . . , L - 1, 
respectively. 

Proof: See Appendix Icl ■ 
The expressions (|43l and ( 144} determine the evolution of the asymptotic equivalent channel (142} with respect to i. Thus, 
they are referred to as DE equations. 
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As is obvious from the proof of Theorem [4] the GA for the postulated interference (|32t is exact in the large-sparse-system 
limit. Thus, we have the following result. 

Theorem 5. Suppose that \24\ is picked up from the (r, L, W)-ensemble, presented in Example\3\ Then, the equivalent channel 
for the BP receiver with GA converges to the asymptotic equivalent channel for the true BP receiver, i.e. the RHS of A42\l in 
the large-sparse-system limit. 

Proof: Repeat the proof of Theorem |4] ■ 
From Theorem [5] the performance of the BP receiver with GA is indistinguishable from that of the true BP receiver in the 

large-sparse-system limit. Thus, we hereafter focus on the true BP receiver. 

We next consider the infinite-iteration limit i — > oo. Numerical evaluation implies that the asymptotic SIRs (l43l monotonically 

converge toward a fixed-point of the DE equations (|43T > and (l44l as i — > oo. It is shown that the BP receiver can achieve the 

same performance as that for the (soft) IO receiver if the fixed-point is unique. 

Theorem 6. Suppose that K24\ is picked up from the (r, L, W)-ensemble, presented in Example\3\ If the fixed-point of the DE 
equations A43\) and A44\l is unique, the asymptotic SIR for the BP receiver converges to that for the ( soft) IO receiver in the 
infinite-iteration limit i — > oo after taking the large-sparse-system limit. 

Proof: We follow an argument based on a genie-aided BP receiver in ITT91 to prove Theorem [6] For a fixed number of 
iterations i, let yj?\, denote the set of the received signals {y n .i} utilized in the BP detection of bk,v- The data symbols are 

(i) (i) 

classified into two groups: the data symbols X^ j, that connect only to the function nodes in J/i and the remaining data 
symbols. The ACF property of the (r, L, W) ensemble implies that xj, l j, consists of the data symbols whose depth (distance 
from the root bk,v) is less than or equal to 2(i — 1). Furthermore, y^\, is composed of the function nodes that have depth less 

than 2i. The BP detection of bk.i> with the number of iterations i is equivalent to the soft IO detection of bk,i> based on y k v , 
whereas the soft IO detection of bk,v is based on the entire received signals y. We use a genie-aided BP receiver to obtain 
an upper bound on the asymptotic SIR of the soft IO receiver. Let a genie inform the BP receiver about the data symbols not 
in X k \, . Since information about the received signals not in y k \, is passed only through the known data symbols with depth 

2i, using the information y\yj? \, does not improve the performance of the genie-aided BP receiver. Thus, the soft IO receiver 
cannot outperform the genie-aided BP receiver. In other words, the asymptotic SIR for the genie-aided BP receiver provides 
an upper bound on that for the soft IO receiver. 

The performance of the genie-aided BP receiver can be evaluated in the large-sparse-system limit by repeating the proof 
of Theorem |4] The asymptotic SIR converges to (l43l . which are determined by the DE equations (l43l and (144-b . The only 
difference is that the initial condition is not t;^ = 1 but £,v = 0, because the data symbols with depth 2i are known to 
the receiver. Let us take the infinite-iteration limit i ^ oo. Since we have assumed that the DE equations have the unique 
fixed-point, the solution to the DE equations should converge to the unique fixed-point as i —> oo, regardless of the initial 
condition. This observation implies that the performance of the genie-aided BP receiver coincides with that of the BP receiver 
as i — » oo after taking the large-sparse-system limit. Since the performance of the soft IO receiver is sandwiched between the 
performance of the two BP receivers, the BP receiver can achieve the same performance as that for the soft IO receiver. ■ 



B. Threshold Analysis 

We have so far considered the three limits: the large-system limit, the dense limit, and the infinite-iteration limit. In this 
section, the remaining limits L — > oo and W — > oo are investigated. We start with the definition of the BP threshold. 

Definition 1 (BP Threshold). Let {sir[, opt '} denote the fixed-point of the DE equations rf?il ) and (O) that has the largest SIR 
at the middle point V / L = 1/2 among all possible fixed-points. The BP threshold is defined as the supremum of /3th such that 
the asymptotic SIRs {sir,, } converge to {sir[° pt ' ) } as i — > oo for all f3 € (0, /3th). 

For the SC-SCDMA system, reliable information about the data symbols should propagate toward the middle point I'/L = 
1/2. Thus, the asymptotic SIR at the middle point should be worst among those for all positions. Furthermore, the SIR 
sir^fg at me middle point is close to the SNR 1/ct„ when fi is below the BP threshold, as shown in Section I VII so that the 

SIRs {sir,, are close to l/cr^ at all positions. This implies that the BP threshold corresponds to a boundary between the 
interference-limited region and the non-limited region for the BP receivers. The BP receivers can mitigate the MAI well when 
(5 is below the BP threshold. 

If the DE equations ((43) and ( l44b have the unique fixed-point {sir,, }, the asymptotic SIRs converge to {sir;, } as 
i oo. Otherwise, the asymptotic SIRs are expected to be trapped in the other fixed-point, as noted in Section [XT] Thus, the 
BP threshold should be equal to the supremum of /3 t h sucn mat the DE equations d43l and ( |44T > have the unique fixed-point 

for all system loads f3 £ (0, /0th)- 
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In order to distinguish the BP threshold for the SC system from that for the uncoupled system, the one for the uncoupled 
system W — is denoted by /3b p- On the other hand, the BP threshold for the SC system in the limit L, W oo with 
7 = W/L -> is written as /3^ S p C) . 

~ (go 

Before evaluating the BP threshold /3g P for the SC-SCDMA system, we shall review the performance assessment of the 
soft IO receiver for the uncoupled dense CDMA system based on the non-rigorous replica method [10|, [12], and define the 
IO threshold. 

Proposition 1 (Tanaka 2002). The asymptotic SIR of the soft IO receiver for the uncoupled dense CDMA system with system 
load (3 converges to s in the large-system limit, in which s is a solution to the following fixed-point equation, 

1 



(45) 



F(s) = pC(s) + | [als 



where £(s) denotes the MSE for the posterior mean estimator of the BPSK data symbol transmitted through the scalar AWGN 
channel with SNR s. If the fixed-point equation has multiple solutions, the solution s is selected to minimize the free energy 

' ' ' Ha 2 a s) - 1] , (46) 

where C(s) denotes the input-output mutual information in nats for the BPSK-input scalar AWGN channel with SNR s. 

The fixed-points of the DE equations d43l > and d44i > for the uncoupled SCDMA system W = coincide with the solutions to 
the fixed-point equation d45l ). When the fixed-point equation (|45]> has multiple solutions, there is a difference in performance 
between the BP and IO receivers. In that case, the IO receiver can achieve the largest solution to d45l l if it is the global 
minimum of the free energy (|46l , whereas the BP receivers cannot. 

It is straightforward to find that the solution s to the fixed-point equation (|45l l corresponds to a stationary point of the 
free energy (PES! , with a general relationship between the mutual information C(s) and the MSE £(s) for the scalar AWGN 
channel (33) 

= (47) 

Korada and Montanari l40l proved that the minimum of the free energy (|46| | over s is equal to the sum capacity in nats for 
the uncoupled dense CDMA system with BPSK inputs in the large-system limit. Unfortunately, it is still open whether or not 
the asymptotic SIR for the soft IO receiver coincides with the solution s to minimize the free energy (|46| > when the fixed-point 
equation d45b has multiple solutions, although the non-rigorous replica analysis 11101 . |[T2l suggests so. 

The fixed-point equation d43T > has the unique solution for all system loads in the low-to-moderate SNR regime. On the other 
hand, it has multiple solutions for high system loads in the high SNR regim^E In other words, the free energy (l46l is bistable 
for high system loads, as shown in Fig. Q] We hereafter focus on the latter situation. 

Only the free energy (l46l at the solutions to (l45l l is used in Proposition Q] Consequently, there is ambiguity in expression 
for the free energy. We use this ambiguity to derive another expression of the free energy that is suitable for understanding 
the BP threshold for the SC-SCDMA system. Let us consider the free energy F(s) obtained by substituting d45b into s in the 
second term of i 



F(s) =[3C{s) 
=[3C(s) 

In the derivation of the last expression, we have used 
even if the free energy (l48l were used instead of 
Theorem Q] 



In- 



In 



1 



(48) 



again. The statement of Proposition [TJ would be obviously unchanged 
The free energy (l48l will be shown to correspond to the potential in 



Remark 5. The shape of the free energy ( I4.SI ) as a function of s is qualitatively the same as that of the original one 
fact, calculating the stationarity condition for ( I4<SI ) yields 



F'(s) 



1 



#00 



o. 



In 



(49) 



where we have used j47j . Since the MSE £(s) is a monotonically decreasing function of SNR s, the stationarity condition \49\ 
reduces to the original one \45\ . The values of the free energy at the stationary points coincide with those of the original 
one \46\ at the same stationary points. Furthermore, any two adjacent stationary points in the free energy are connected by a 
monotonic curve. These observations imply that the metastable, unstable, and global stable solutions to the free energy 
are equal to the corresponding solutions of the original one ( 1461 ), respectively. 



3 This is the definition of the term "high system load" in this paper. 



16 



IEEE TRANSACTIONS ON INFORMATION THEORY. VOL. , NO. , 2012 



We shall present the definition of the IO threshold for the uncoupled CDMA system. 

Definition 2 (IO Threshold). The IO threshold Ao for the uncoupled CDMA system is defined as the supremum of Ah such 
that the asymptotic SIR for the IO receiver is equal to the largest solution of the free energy \46\ or (I4<SI) for all ft £ (0, Ah)- 

Proposition Q] implies that the IO threshold Ao is equal to the system load j3 such that the free energy (l46t or (l48l has two 
global minima. It is obvious that the IO threshold Ao is larger than the conventional BP threshold /?bp- The IO threshold 
corresponds to a boundary between the interference-limited region and the non-limited region for the IO receiver. The IO 
receiver can mitigate the MAI well when /? is below the IO threshold. 

We move on to the evaluation of the BP threshold for the SC-SCDMA system. The following result implies that the BP 
threshold can be improved up to the IO threshold by spatial coupling in the limit L, W — > oo with 7 = W/L —> 0. 

Theorem 7. Suppose that there exists a smooth function s(x,t) that satisfies s(V / L,iAt) = sir,, with small At > for all 
I' G C in the limit L, W — ¥ 00 with 7 = W/L — ¥ 0. Let us define u(x,t) as u(x,t) = f(s(x,t)), given by 



/'"•= / s ~^ds. (50) 



Then, the variable u(x,t) for x £ (0, 1) satisfies the following partial differential equation: 



B(u) fdu\ 2 



9 (hi \ du 
to ( B{u) dx- 



dx J 

+ 0( 7 3 ,At 2 ), (51) 



in the limit L, W — > 00 with 7 = W/L — > 0. In ( 1571 ), the free energy F(s) is given by A48h Furthermore, the function B(u) 
is defined as B(u) — B(f~ 1 (u)), given by 

B w°- idrlwF >0 ' <52) 

where £(s) denotes the MSE for the posterior mean estimator of the BPSK data symbol transmitted through the scalar AWGN 
channel with SNR s. 

Proof: See Appendix ID"1 ■ 
The system ( T5TT > is a continuum approximation of the DE equations ( l43b and ( f44b . The spatial differences {sirf, — sir^ } are 
expected to be small in the limit L, W —> 00 with 7 = W/L — > 0, so that one can safely assume the existence of a spatially 
smooth function s(x,t) in Theorem|7] On the other hand, the temporal difference (sir,, — sirj, 1-1 ^) may be small except for 
first several iterations. Thus, the system (IBTt may be able to provide a good approximation for the DE equations ( |43l and (PBi l 
especially in large i. As seen in the proof of Theorem Q] the BP threshold for the system dBTl l, which is an example of the 
phenomenological system @, is characterized by properties of stationary solutions to (IBTt . Thus, the system (IBTT l is expected 
to provide a good prediction of the BP threshold for the SC-SCDMA system, although dBTl ) cannot explain the dynamics of 
the DE equations d43b and d44l) in initial iterations. 

We use Theorems Q] and [7] to characterize the position of the BP threshold j3%p for the SC-SCDMA system. Expressions (|43T > 
and (|44] > imply that siTq > (a^ + /?mit) -1 and sir^^ > (<j^ + Anit) 1 hold. In other words, we have the lower bounds 
u(0,t) > + Anit) -1 ) an d t) > + Anit) -1 ) for any t. In order to derive a lower bound on the BP threshold, 

we consider the system ( BTT l with the boundary condition u(0, t) = u(l, t) = u T , in which u r denotes the larger stable solution 
of Suppose that Anit is so small that f((a^ + Anit) -1 ) > u r holds. The BP threshold for the system (l5lT l 

with u(0,t) = u(l,t) = u T provides a lower bound on the BP threshold A3P • Let /3g P denote the phenomenological BP 
threshold at which the free energy (|48) has two global minima. By Definition [2] ft^p^ is equal to the IO threshold Ao- Since 
the phenomenological BP threshold is a lower bound on the BP threshold for the system (IBTT l with u(0, t) = u(l, t) = u T , as 
shown in Section |Ul we obtain the lower bound Ao = /3rp^ — Pbp on tne BP threshold for the SC-SCDMA system. 

Theorems Q] and Q imply that the DE equations ( |43l l and (l44l > have the unique fixed-point in the limit W, L — > 00 with 
7 — > if j3 is smaller than the phenomenological BP threshold A^p"^ ■ From Theorem |6] the BP receiver is optimal in the limit 
W,L->oo with 7 -> for (3 < Af p C) . 

Remark 6. We shall conjecture the position of the BP threshold /3g P for the SC-SCDMA system. The non-rigorous replica 

analysis presented in H25V implies that the IO threshold A^o^ f or ^ e SC-CDMA system converges to the conventional IO 

~(SC) (SO 
threshold Ao from above in the limit L, W — > 00 with 7 — > 0. The BP threshold p Bp is bounded from above by A Q > *° 

"(SO 

that we obtain /3g P < Ao ln tne limit L, W — >• 00 with 7 — > 0. Consequently, we can conclude 

Pbp = Pbp = Ao, (53) 
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Fig. 6. Multiuser efficiency at I'/L = 1/2 versus for \jo\ = 10 dB, L = 32, and /3 init = 0. 



if the replica analysis is correct. Thus, we hereafter refer to /3g P as the BP threshold for the SC-SCDMA system. 

(SO 

The convergence of /3j Q to j3\o implies that spatial coupling never improves the performance of 10 detection. In other 

words, spatial coupling should be regarded as a method for improving the performance of iterative detection. Unfortunately, 

(SO 

we can provide no rigorous proof for the convergence, which is intuitively understood as follows: The reason why Pjq is 
above P\q is due to the rate loss, which vanishes in the limit L, W — > oo with 7 — > 0. 



VI. Numerical Results 

A. Density-Evolution Analysis 

The DE equations d43l l and (l44t are numerically solved to estimate the position of the BP threshold for the SC-SCDMA 
system. We focus on the ME rff' — a^sir^ in iteration i. Since the asymptotic SIR sir ; , must be smaller than the SNR l/c„, 
the ME takes a value between and 1. Figure [5] shows the ME m, for (3 = 1.97 and ft = 1.99. The theoretical prediction 
/3^p C) of the BP threshold for the SC-SCDMA system based on Theorem □ is given by ^ S p C) « 1.98267 for l/o* = 10 dB. 
For B = 1.97 < p^ T\ the BP receiver first obtains reliable information about the data symbols at the boundaries V / L = and 
I'/L = 1 — 1/L, transmitted in the initialization phase. Then, the reliable information propagates toward the middle position 
I'/L = 1/2 as i increases. Eventually, the ME tends toward an almost uniform solution, which is close to 1 for all positions 
I'. This result implies that the BP receiver can eliminate the MAI for P — 1.97. For p = 1.99 > /3^?\ on the other hand, the 
ME tends to a non-uniform solution after many iterations: The ME for i = 10° is close to around the center I'/L = 1/2, 
whereas it is close to 1 near the boundaries I'/L = and I'/L = 1 — 1/L. This observation implies that the system is 
interference-limited for p = 1.99. 
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TABLE I 

BP THRESHOLDS FOR THE SC-SCDMA SYSTEMS. = 10 DB AND /3 init = 1. /3 BP AND /9g S jf ' ARE APPROXIMATELY GIVEN BY /3 BP fs 1.73078 

AND /3' S p C) « 1.98267, RESPECTIVELY. 





W 


1 


2 


3 


4 


L 


16 


1.97947 


1.99150 


2.04385 


2.16470 


32 


1.97925 


1.98266 


1.98321 


1.98665 


64 


1.97925 


1.98264 


1.98267 


1.98267 


128 


1.97925 


1.98264 


1.98267 


1.98267 



TABLE II 

BP THRESHOLDS FOR THE SC-SCDMA SYSTEMS. = 12 DB AND /3i n j t = 1. /3p,P AND /3gp ARE APPROXIMATELY GIVEN BY /3 B P ~ 1.87344 

AND /3' SC) Rj 2.50716, RESPECTIVELY. 





W 


1 


2 


3 


4 


L 


16 


2.38479 


2.49386 


2.53057 


2.65917 


32 


2.38479 


2.49314 


2.50589 


2.50726 


64 


2.38479 


2.49314 


2.50588 


2.50705 


128 


2.38479 


2.49314 


2.50588 


2.50705 



In order to investigate the accuracy of the continuum approximation, we focus on the fixed-points to the DE equations d43b 
and ( f44b . See lEOl for how to find the fixed-points to the DE equations d43l and ( f44b . Figure [6] shows the ME at the center 
I'/L = 1/2, given via the fixed-points to (f43b and (|44T >. The MEs for W = 1 and W — 2 are represented by the solid and dashed 
lines, respectively. The dotted line shows a theoretical prediction of the ME in the limit L, W —> oo with 7 = W/L —> 0, 
based on the continuum approximation. The prediction is obtained by combining Theorem [3] and Theorem [7] The theoretical 
prediction is indistinguishable from the ME for W = 2, The DE equations d43l and d44b have the unique fixed-point when f3 

niSC) (SC) 

is smaller than /3g P , which is shown by the vertical line, whereas there are multiple fixed-points for p > /3^ p . The ME for 
W = 1 displays oscillating behavior around the BP threshold /3 BP , as shown in the inset of Fig. [6] The same phenomenon 
was observed in SC-LDPC codes [20|. The wiggle decreases slightly the maximum of /3 at which the DE equations (03]) and 
d44i l for W = 1 have the unique fixed-point. Since the amplitude of the wiggle decreases rapidly with the increase of W, the 
ME for W = 2 is indistinguishable from the theoretical prediction shown by the dotted line, except for a neighborhood of 
/3 = /3 B p ■ This oscillating behavior around /JgjF^ seems to disappear in the limit L,W — » 00 with 7 = W/L — > 0. These 
observations imply that the continuum approximation is accurate in the limit L, W — > 00 with 7 = W/L — > 0. 

Tables U and III] list the BP thresholds for the SC-SCDMA system for SNRs \/o\ = 10 dB and \/o\ = 12 dB, respectively. 
The thresholds were estimated by solving the DE equations (l43l and d44b numerically. Note that the listed values are not 
the average system load /3 but the system load j3 in the communication phase. We find that the thresholds for L — 16 are 
larger than the theoretical prediction /3 BP rj 1.98267 in the limit L, W — > 00 with 7 = W/L — > 0, except for W = 1. This 
observation is due to strictly positive 7: When /3 is slightly larger than as noted in Remark [2] the partial differential 

equation d5T1 ) for 7 > may converge to an almost uniform solution, like the solution for i — 1800 in Fig. [5] 

We have so far investigated the static properties of the DE equations d43l and d44b . We next consider the dynamic properties 
of the DE equations. Figure Q shows the asymptotic ME m, at I'/L = 1/2 as a function of the number of iterations i. The 
conventional BP threshold /3 B p is approximately equal to /3bp ~ 1.73078 for SNR \/o\ = 10 dB, while the BP threshold for 

(SC) 

the SC-SCDMA system is given by pg P ps 1.98267. The number of iterations required for convergence increases as p grows. 
Interestingly, the SC-SCDMA systems converge to the stationary solutions more quickly than the uncoupled system W = 
for f3 = 1.73, whereas all systems converge at the same speed for f3 — 1.55 < /3bp- This observation is because the uncoupled 
system requires infinitely many iterations for convergence when /3 tends to the BP threshold /3bp from below. The number of 
iterations required for L = 64 and W = 2 is roughly half the number of iterations for L = 64 and W — 1, while the one 
required for L = 128 and W = 1 is roughly twice. These results are consistent with the intuition that reliable information 
at the boundaries I' = Q and V = L — 1 should propagate toward the middle position Z' = L/2 at a speed proportional to 
7 = W/L. 

B. Numerical Simulations 

Numerical simulations of the BP receivers are presented. We first focus on uncoupled SCDMA systems, i.e. W = 0. Figure [8] 
plots the BERs of the (exact) BP receiver. The results for r = 4, 6, 8 are denoted by {+}, {x}, and {□} connected with solid 
or dashed lines, respectively. The analytical results in the large-sparse-system limit are also shown by the dotted lines. When 
j3 = 1, the large-sparse-system result can provide a good approximation of the BER for r = 4. When j3 — 1.45, however, 
there are gaps between the large-sparse-system result and the BERs for r = 4, 6, 8 especially in the moderate-SNR regime. 
The large-sparse-system result provides a larger estimate than the actual BER in the low-SNR regime, whereas it predicts a 
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Fig. 7. Multiuser efficiency at Z'/L = 1/2 versus the number of iterations for l/<xj| = 10 dB and /3j n j t = 0. 



1 




UN in dB 



Fig. 8. BER versus SNR for K = 1000 and W = 0. The number of iterations is equal to 40. 



smaller BER in the high-SNR regime. These observations imply that, as the system load grows, larger r is required in order 
for the large-sparse-system result to provide good approximations for the actual BERs. 

Figure [9] shows a comparison between the SCDMA systems with and without spatial coupling. We used the BP receiver 
with GA to reduce the computational complexity. The horizontal axis is the average system load given by d25l l. The BER at 
the middle position I'/L = 1/2 is plotted for the SC-SCDMA system. The BERs for the uncoupled SCDMA systems are 
denoted by {+} or {x} connected with dashed lines. The BERs for the SC-SCDMA systems are represented by {+} or {x} 
connected with solid lines. We find that the performance of the SC-SCDMA system with K = 2048, L = 16, W — 1, and 
Anit = P is superior (resp. comparable) to that of the uncoupled SCDMA system with 2048 (resp. 32768) users. Note that the 
SC-SCDMA system performs the BP detection for every L symbol periods, whereas the uncoupled SCDMA system does for 
every symbol period. However, this delay of detection does not necessarily result in the overall delay for coded systems. Since 
L is commonly smaller than the code length, the overall delay is dominated by the decoding delay. Thus, the comparisons 
between the uncoupled SCDMA system with 2048 users and the SC-SCDMA systems make sense for practical coded systems, 
although the detection delay for the SC-SCDMA systems is larger than for the uncoupled SCDMA system. These observations 
imply that the SCDMA system with one-dimensional coupling can accelerate the convergence speed toward the large-system 
limit, compared to the uncoupled SCDMA system with the same number of users. Furthermore, the SC-SCDMA system with 
K = 2048, L = 16, W = 1, and Anit = 1-4 can provide a significant improvement in BER for high system loads, compared 
to the SC-SCDMA system with Anit = A The BERs for the SC-SCDMA system with Anit = 1.4 seem to be trapped by the 
top (bad) solution obtained from the large-sparse-system analysis, when the average system load is equal to 1.9. This is due 
to finite L, W, K, and N: Substituting the BP threshold A^r?' ~ 1-98267 into /3 in the average system load (J25j, we find 
that the corresponding average system load is approximately equal to 1.93. The remaining gap 0.03 seems to be due to finite 



20 



IEEE TRANSACTIONS ON INFORMATION THEORY. VOL. , NO. , 2012 



1 [ 1 1 1 1 1 1 1 1 1 r 

large-sparse -system for W=6 

— I — A=2048, L=16, W=l, p injt =l<4 




1.5 1.55 1.6 1.65 1.7 1.75 1.8 1.85 1.9 1.95 2 
Average System Load 



Fig. 9. BER versus average system load for 1/<xH = 10 dB and r = 32. The number of iterations is equal to 1000. 



K and N. 

In order to achieve the improved BP threshold, L must tend to infinity. Let us discuss the effect of increasing L for finite 
K and N. If K and N were infinity, reliable information at the boundaries I' = and I' = L — 1 could propagate to the 
adjacent positions successfully. For finite K and N, however, it is probabilistic whether reliable information can propagate 
to the adjacent positions successfully. As L increases, thus, it becomes difficult for reliable information to propagate to the 
middle position successfully. Increasing K and N results in a reduction of the probability with which the propagation of 
reliable information to the adjacent positions fails. These arguments imply that the system size required for achieving a BER 
close to the asymptotic one increases as the average system load gets closer to the improved BP threshold. 



VII. Summary & Conclusions 

The SC-SCDMA system has been proposed to improve the performance of iterative MUD based on BP. We have derived the 
two iterative receivers, one based on exact BP, and the other on approximate BP with GA. The two BP receivers can achieve 
the same performance in the large-sparse-system limit. The continuum approximation of the DE equations for the two receivers 
implies that the BP threshold can be improved up to the IO threshold by spatial coupling. Numerical simulations imply that 
spatial coupling can provide a significant improvement in BER for a fixed finite-sized system especially in the region of high 
system loads, whereas a quite large system is required for approaching the IO threshold. 

We remark a capability of the phenomenological methodology for specifying the BP threshold for SC systems and a direction 
of future work to conclude this paper. The phenomenological result presented in Section UD is applicable to characterizing the 
BP threshold for any SC system, if DE equations for the corresponding uncoupled system are described by one parameter and 
if a continuum approximation of DE equations for the SC system is included in the phenomenological model |0. Of course, 
the phenomenological method is not applicable to all SC systems. A further generalization of the phenomenological model is 
left as a future work. 



Appendix A 
Derivation of (TTTb 

We shall show that (fTTT i is equivalent to the canonical system (O with A{u) = —B'(u)/2. For functions u(x, t) and v(x, t), 
we expand the energy functional H(u + ev), given by ( tT2i i. around e = to obtain 



OH 

H{u + ev) = H(u) + + 0(e 2 ), 



(54) 



with 



8H f 1 



V'(u) + 



j 2 B'(u) fdu 

dx 



vdx 



(55) 
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Since u(x, t) + ev(x, t) must satisfy the boundary conditions u(0, t) + ev(0, t) — u r and u(l,t) + ev(l, t) = u r around e = 0, 
we impose the boundary conditions u(0,t) = u(l,t) = u r and v(0,t) — v(l,t) = 0. Integrating by parts the last term in 
with the boundary condition v(0, t) = v(l, t) = yields 



§< u ) = |{n<0 + r 



B'{u) fduY 
2 (dx) 



d ( du 

This expression implies that the functional derivative of (fT2] > is given by 



5H 

Su 



(it) = V'(u) +7 2 



B'(u) (du 
dx 



v(x, t)dx. 



dx \ ^ ^ dx 



Substituting (|57] > into dTTb , we find that (fTTb is equal to the canonical system (O with A(u) = —B'(u)/2. 
We next show (fT~3T >- Differentiating (fT2l i yields 



dH(u) 
dt 



V 



YB 



YB' (du 

dx 



du 

m dx 



du d 2 u 
dx dtdx 



dx. 



Integrating by parts the last term, we obtain 

dH(u) 
dt 



f 




10 





B 1 (d 



dx 



— (b— 

dx \ dx 



du 



dt 



dx, 



(56) 



(57) 



(58) 



(59) 



where we have used du(0, t)/dt= du(l, t)/dt= 0, obtained from the boundary condition u(0, t) = u(l, t) = u r . Substituting 
© with A(u) = -B'{u)/2 into (g9) yields 



dH(u) 
dt 



1 'du" ' 
dt 



dx, 



(60) 



which is equal to ( fT3b . because of ( fTTT i. 



Appendix B 
Proof of Theorem[3] 



We start with proving the following lemma. 



Lemma 1. Suppose that A(u) — —B'(u)/2 and that u T is the metastable solution of the effective potential U(u). If there is 
a solution {{(1/2) £ (u\,u un ) to the fixed-point equation F{u{l/2)) = {2^)^ 1 , with 



F(x) 



dy 



L ^2{U{y)-U{u{\/2))Y 

then there is a non-uniform stationary solution u{x) to \15\ . Furthermore, the solution u(x) satisfies 

F(u(x)) — 7 _1 min(x, 1 — x). 

Proof of Lemma\l} We confirm that u(x) satisfying d62l l is a stationary solution to (15[ . It is obvious that the solution 
satisfies the boundary condition £t(0) = u(l) = u T . Let us show that the solution d62l > is a solution to (fT8] l. Differentiating 
with respect to x yields 

7 & f -y/Ufv) - U{u(l/2)) for x< 1/2 



(61) 



(62) 



i/2dx \ ^U{u) - U(u(l/2)) for x > 1/2, 



(63) 



which is equivalent to ( fT8l with the constant C = —U(u(l/2)). Furthermore, it is straightforward to show that the LHS of 
(l63l is differentiable with respect to x. These observations imply that the solution d62l is indeed a non-uniform stationary 
solution to (TT3T >. ■ 

Remark 7. The conservation of mechanical energy explains why u(X/2) must be between u\ and u un , as defined in Lemma\T] 
Let us assume that there is a solution u(x) to the Newton equation ( |i7| ) such that the solution is on the right maximum of the 
inverted potential —Uiu) at x — 0, and arrives at some position u{l/2) > u nn at x — 1/2. The velocity u'(l/2) at x = 1/2 
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must be zero, because of the differentiability of u{x) at x = 1/2. However, the definition of u un , i.e. U(u un ) — U{u r ) implies 
— U{u{\/2)) < —U(u T ), which breaks the conservation of mechanical energy: 

^-u'(l/2) 2 - t/(u(l/2)) < ^«'(0) 2 - U(u r ). (64) 

Thus, u(l/2) must be smaller than u un . 

The same argument explains that u(l/2) must be larger than u\. Let us assume m(1/2) < u\. Then, the conservation of 
mechanical energy implies that the velocity u'{l/2) at x = 1/2 must be non-zero, since ii\ is the global maximizer of the 
inverted potential —U(u). However, the non-zero velocity u'(l/2) contradicts the differentiability of u(x) at x = 1/2. Thus, 
u(l/2) must be larger than U\. From the arguments above, u{l/2) must be between u\ and u un . 

In order to prove the first part of Theorem |3] we show that F(u{\/2)) is bounded for all u(l/2) <G (ui,u un ), and that 
F(u(\/2)) tends to infinity as £t(l/2) — > u\ or as w(l/2) — > u un . Lemma [T] and these properties of F(u(l/2)) imply that 
there are two non-uniform stationary solutions to (IT31 for sufficiently small 7 > 0. 

We shall show the former property of F(u{l/2)). Let u$ denote a value between m(1/2) and the unstable solution of the 
effective potential U(u). Splitting the interval of integration (-0,(1/2) , u T ) into the two intervals (u(l/2), u ) and (w , u T ) yields 



F(fi(l/2)) = 



s( i/2) y/2{U(y) - U(u(l/2))} 
dy 



dy 

("(1/2))} 

(65) 



y/2{U(y)-U(u(l/2))} 

The condition u(l/2) < u un implies that the second term is bounded, because of U(y) > U(u(l/2)) for all y g [uo,it r ]. 
Thus, we focus on the first term. Let u denote an appropriately chosen value between u(l/2) and u$. From the mean value 
theorem, we obtain 

dy 

u ( i/2) ^/2{U(y)-U(u(l/2))} 
fi ° dy 
(1/2) y/2U'(u)(y-u(l/2)) 

1 r° dy 



< sup 



u6(u(l/2),8o) yJU'{u) Ju(l/2) y/2(y - 5(1/2)) 

= V2(5o- «(l/2)) sup J— , (66) 
«e(«(i/2),fi ) V U '{u) 

which is bounded, because of u(l/2) > u\. Thus, we find that F(u(l/2)) is bounded for all u(l/2) £ (ui,u un ). 

We next show the latter property of F(u(l/2)). The upper bound d66l l on the first term of (l65t diverges as u(l/2) — > u\, 
because of U'(u\) = 0. It is straightforward to show that the first term of (l65i tends to infinity as u(l/2) — > iii. On the 
other hand, the second term of (l65t diverges as u(l/2) — > u un , owing to U(u T ) = U(u un ). These observations imply that 
u(l/2) = ui and u(l/2) = u un are solutions to the fixed-point equation F(u(\/2)) = (2-f)~ 1 in the limit 7 — >■ 0. 

We have shown that there are two non-uniform stationary solutions u s (x) and u un (x) to (fT~5T > for sufficiently small 7 > 0, 
and that u s (l/2) and u un (l/2) tend to u\ and u un in the limit 7—^0, respectively. We next prove that u s (x) converges to u\ 
for x € (0, 1) in the limit 7—^0, and that u un (x) tends to u T for x 7^ 1/2 in the limit 7—^0. Since the stationary solutions 
are symmetric about the axis x = 1/2, without loss of generality, we focus on the interval [0,1/2). Differentiating d62l for 
x € [0, 1/2) with respect to x yields 

= 7 -S (67) 

y/WWti) ~ U(u(l/2))} 

where we have used ( RxO . For the stationary solution u(x) = u s (x), the denominator on the LHS of (|67] > should diverge as 
7^0. Since u\ is the global stable solution of the potential U(u), the stationary solution u s (x) tends to u(l/2) = u\ for 
x € [0, 1/2) in the limit 7 — > 0. Similarly, we find that u U n(x) converges to u r or u un for x £ [0, 1/2) in the limit 7 — > 0, 
because of U(u un ) — U(u T ). 

We shall prove the convergence of u un (x) toward u r for x £ [0,1/2) in the limit 7—^0. For that purpose, we show 
u U n{x) > u U n for x £ (1/2 — f, 1/2), with sufficiently small e > 0. Combining this property and u' un (x) < for x £ [0, 1/2), 
obtained from (l67l i. we find u un (x) > u un for x £ [0, 1/2). This implies that u un (x) tends to u r for x £ [0, 1/2) in the limit 
7^0. 

Let us prove u un (x) > u un for x £ (1/2 — e, 1/2), for sufficiently small e > 0. The mean value theorem implies 

<n(*) - <n(V2) + Kn(x) (x ~ (68) 
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for x G (1/2 — e, 1/2], where a; is a value between x and 1/2. Substituting v! UXL {l/2) — and (jT7j into (|68l l, we obtain 

4w-^(-j). (w 

This expression implies that ttu n (ir) is strictly negative for x E (1/2 — e, 1/2), because of U' (u un (x)) > 0. Thus, we find 
u U n{x) > u un (l/2) = ii un in the neighborhood (1/2 — e, 1/2). This observation corresponds to the physical fact that a free 
particle cannot continue to stay at a point on a smooth slope. 

We complete the proof of Theorem [3] by analyzing the stability of the non-uniform stationary solution u s (x) in the limit 
7 — > 0. Repeating the derivation of (fTTT l, we find that (Q3) can be represented as 

d -l = -B{ g -\u)) 6 -^(u), (70) 



with 

if(u 



dx. (71) 



Since -B(it) is positive, u s (x) is stable if u s (x) is a (local) minimizer of the energy functional (T7TT >. We shall prove 

lim H(uJx)) = U(ui), (72) 

which implies that u s (x) attains the global minimum of fi7T[ in the limit 7—^0, since u\ is the global stable solution of the 
potential U{u). Thus, the solution u s (x) is stable in the limit 7 — > 0. 
Let us prove d72l . Substituting d63l into dTTT l yields 

H{u s {x)) = 2 / U(u s (x))dx - U{u s {\/2)), (73) 



Since the integrand U(u s (x)) is obviously bounded for all x S [0, 1], the dominated convergence theorem implies 



lim H{u s (x)) = 2 / lim J7({t s (x))dx - = U(u{), (74) 

where we have used the fact that u s (x) converges to u\ for x € (0, 1) in the limit 7 — > 0. 

Appendix C 
Proof of Theorem[4] 

A. Repammetrization 

In order to prove Theorem |4] we evaluate the evolution of the tentative marginal posterior probability (f29) in the large- 
sparse-system limit, following [19|, ll4Tl . The marginal posterior probability is a random variable on the space of probability 
distributions, because of the randomness of y and G. Since we have assumed BPSK, the marginal posterior probability (|29T > can 
be represented with one parameter. Selecting the log likelihood ratio (LLR) as the parameter is suitable for proving Theorem |4] 
Evaluating the evolution of the tentative marginal posterior probability ((29) is equivalent to tracing the evolution of the pdf of 
the LLR. 

Let 1,9 ij^.j, denote the LLR for the message ( f30b provided from the function node (n, I) to the variable node (k, I') in 
iteration i, 

„=ln ffi ( **-" =1) , (75) 
{n , lW ) $(^,, = -1) 

Furthermore, we write the LLR for the message (|3ll propagating along the same edge in the opposite direction as 

r (0 , m Si(Pk,i> = 1) 

'""*' = k *^)' (76) 

The product step (l3TT l can be represented as follows: 

r(«) \ ' r(«) C77-V 

^(fe,i')^(n,i) " (n,[)^(fc,i')' k ^ 

(ft,l)ea(fc,i')\(n,J) 

The ACF property of the (r, i, M / )-ensemble presented in Example |3] guarantees that the incoming LLRs {Vf} - } are 

independent random variables in the large-system limit. Furthermore, the central limit theorem implies that the LLR (1771 ) 
converges in law to a Gaussian random variable in the large-sparse-system limit, i.e. in the dense limit after taking the large- 
system limit. Thus, it is sufficient to evaluate the mean and variance of the LLR ( fTTT l conditioned on the data symbols 
in the large-sparse-system limit. In the proof of Theorem 5] we always fix the data symbols and omit conditioning with respect 
to the data symbols. 
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B. Density Evolution 

1 ) Mean and Variance of fi77\) : In order to calculate the mean and variance of the LLR ( 177} , we first define and calculate 
several quantities. Let us define fj(y) and n {y) as 

fi (y) = ( v ~ In 2 Xl ' ) 9(y- in,i, k ,v ; <fl farj = o,i, (78) 



if (*/)= E 




^2 
/t2 



*ff(y-J£W;°2) II 

(fc,r')ea(n,0\(fc,e) 



(79) 



for j = 0, 1, 2. In (ED and {79), and J n ,J,fc,J' are given by (f32) and 

E S n,i,fc,r'^fc,i~' 
(k,l')ed{nd)\{k,l>) y (W + 1)0^11, 



l n ,l,k,l' 



(80) 



respectively. Furthermore, (7(2; a 2 ) denotes the pdf ([U for a zero-mean Gaussian random variable with variance a 2 . The two 
quantities (fTBl and ( |79l are used to Taylor-expand the RHS of (130b in the large-system limit. 



Lemma 2. Suppose that 



is picked up from the (r, L, W)-ensemble, presented in Example\3\ Then, 

1 



/_ 



30 ~ft l \v) 
oo ft 1] {y) 



fi(y)dy -> - 



of(i)' 



50 

»U w) (if), 



in f/ze large-sparse-system limit, where the overlines in H8U and ( 1821 ) represent the expectation with respect to 
RHSs of (1571) one/ f |52l , of (i) ant/ of (i) are respectively given by 



1 w n w + 1 



(i-i) 



2'=0 

ir 



w 



;'=o 



(i-i) 
(i-i')i 



with 



&!,,, - E[6g,] 



(81) 

(82) 
In the 

(83) 

(84) 

(85) 
(86) 



In ( 1&3I ) and ( l<S4l ), /3; = K/Ni is equal to /3; n it /or Z = 0, . . . , W — 1 ana! fo /3 /or I = W, . . . , L — 1, respectively. 

Proof of Lemma\2} As we have noted in Section lTV-BI the central limit theorem implies that the postulated interference i 
converges in law to a Gaussian random variable in the dense limit. The mean and variance of ( T32l are given by (l33l l and (l34l . 
respectively. Since d79l depends on {&rr,} only through the postulated interference (l3Zt . we calculate the marginalization in 



(|79l as the expectation with respect to the postulated interference to obtain 

ft 1 \y) = a{y-itjZ^ + &^ 



_ -(t-i) 

a l + V n,l,k,V 



(87) 
(88) 
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where , v and v n l \ k are given by ( T33T > and d34t . respectively. Substituting these expressions into (fSTb and d82l i and 
performing the integrations with respect to y, we have 

00 f^'^tv) 1 

r * h(y)dy-> 7—7^, (89) 

- ft%) <+~^, v 

rtJ /o(2/) ^ ■ 

in the large-sparse-system limit. 

In order to complete the proof, we take the expectation with respect to (l24l i. picked up from the (r, L, VF)-ensemble. 
The weak law of large numbers implies that the variance (l34t converges in probability to the second term on the RHS of 
in the large-sparse-system limit. Since the RHS of (|89l is bounded, this implies that ( TSTb holds. Next, the average of 



(In,l,k,v ~ fin l fc l') 2 over 'EH 1 is e q ual t0 



(*-i)i\2 



, , (6r r, - E[6 l - - 'l) 2 

-C*- 1 ) ^2 M fc -'' 

(in,w -m„, w - 2^ i w + 1)5 : - ' (91) 

(k,l')ed{n,l)\(k,l') l '~ k ' 1 ' 



which converges in probability to the second term on the RHS of (1831 1 in the large-sparse-system limit. This observation implies 
that <|82) holds. ■ 
We shall evaluate the mean and variance of the LLR (fTTT i in the large-sparse-system limit. For that purpose, we use Lemma |2] 
to calculate ( T75l > up to 0(r~ v ). Expanding the RHS of (f30b with respect to s n ,i,k,l'bk,i> / \J {W + l)ci,k,i' U P t0 me second 
order yields 

(i) fi \ 1)/ \ . t(i— 1)/ \ S n i k.l'bk,l' 



J2 (2/n,0 b n,Lk,l' u k,l' + 0( r - 3 / 2 ) (92) 
2 (W+lJcj^J/ 



where (y) is given by (f79b . Note that {&r, r, : (A;, £') € <9(ra, l)\(k, I')} in (f30b are dummy variables, so that we have replaced 
them by {&|*r,} to obtain (|92l . Substituting (|92l into (fTSt and expanding the obtained formula, we have 

r(>) _ fj \yn,l) 2Sn t l t k,l> n f„-3/2\ /nix 

Ww> - /o (i-D (yni) V(^ + l)c W ' + ( ^ ( } 

We note that no term proportional to appears under the BPSK assumption, whereas the term does for general data 
symbols |fl9l, BT1. 

In order to calculate the mean and variance of (193V we use the following expansion: 



P Vn,l 



s n ,i,k,i'bk,i' , T 

+ ln.Lk.l' 



VW + i)c w , 



-f (y n ,i) + fi(y n j) f r :: l ;Y^- = + w 



Evaluating the mean of ( |93l > with this expression, we obtain 

-(0 1 

J (n,Z)^(M') J 



-°°/o ( 1} (y n ,i) (W + l)c ltk , v 

b k ^+0(r- 3 / 2 ), (95) 



(W + l)ci,k,i'df (t) 

where we have used (f8TT > in the derivation of the last equality. Similarly, we use ( 1821 to calculate the variance of d93l as 

-W ! _ 4gf(t) 

(W + l)ci, k ,pdf{i) 

The two expressions ( |95T l and ( |96l l imply that the mean and variance of the LLR ( TTTb converge to 



J [4V)-(»,0^ 2sir ?^'> (97) 
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V[L 



W 1 

(fc,i')->(n,0 J 



4sir|, i) , 



in the large-sparse-system limit, respectively, with 



i (j 

— E- 

+ 1 ^ a 



4 ~ (i) ' 



sir,, = 



w 



1 w 1 



where of (i) and 5f(i) are given by (l83l and d84l l. respectively. In the derivation of 
that (124-b is picked up from the (r, L, W / )-ensemble presented in Example [3] 

In summary, the LLR ( ITTl i converges in law to a Gaussian random variable with mean d97| l and variance 
large-sparse-system limit. 



(98) 

(99) 
(100) 

and ( 1 100b . we have used the assumption 

in the 



2) Decoupling: We shall present the asymptotic expression of the equivalent channel 



(i) 

Let z k denote a Gaussian 

random variable with mean bk,v and variance sir;/' /(sir;, ) 2 . The LLR of the tentative marginal posterior probability 

statistically equivalent to the LLR d77l > or 2sir;, z^\, in the large-sparse-system limit. Since the pmf p(x) for the BPSK random 
variable x E {1, —1} is proportional to exp(Lx/2), with L = \n{p(x = l)/p(x — —1)}, we obtain 



cE 



exp sir ; , «L ,&L j» 



(i) 



■ (») ' 

sir;, 
(sir z ' ) 2 , 



dz 



k,l" 



(101) 



in the large-sparse-system limit, where the posterior probability p(b k % \zu,, 



is given by 



with 



P\ u k,l'\ z k,l' I ~ 77. TTSfi) ■ 

p{z% r -M?7)- x ) 



P (zg,;(sir;;V 1 )= E ^S-sBmC^)" 1 )- 



In order to prove that the equivalent channel (1101b is equal to d42l . we show that sir^ is given by (03J and that sir;, ' is 



(102) 



(103) 



the large-sparse-system limit, the MSE (l85i and the posterior variance (1861 1 are equal to 



equal to sir}, . Since the LLR of the tentative marginal posterior probability d29l ) is statistically equivalent to the LLR d77l ) in 

(104) 
(105) 



# = E 



= E 



where the posterior mean 



is given by 



(i) 



fc,Z' 



E ©K©i*So- 



(106) 



Expressions 

,(0), 



, (1100b . ( 1104b . and (1105b provide DE equations with respect to sir j, 1 ' and sir J, ' . The initial condition 



m^(6fc,r) = 1/2 implies 4° } = ^ = L because of E[b[ 

Let us prove that the DE equations reduce to those presented in Theorem |4]by induction. When i = 1, ( T83l > and (l84l implies 



(0) 



k(o)- 

fc,Z' 



of (1) = erf (1), given by d44l with i = 1. Furthemiore, sir|, = sir), given by (l43l . holds from (l99l and (1100b . Next, suppose 



that the statement holds for iteration i. Since sir,, = sir;,', the posterior probability (1102b is equal to that for the scalar AWGN 
channel (f3Tb . Thus, the MSE ( 11041 ) and the posterior variance (1105b coincide with each other for iteration i. This observation 



implies sir[, i+1 ' ) = sir;,""" given by (l43l for iteration i + 
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Appendix D 
Proof of Theorem[7] 

We first prove the following lemma to show Theorem [7] The lemma is obtained via the continuum approximation of the 
DE equations d43l ) and d44b . 

Lemma 3. Suppose that there exists a smooth function s(x,t) that satisfies s(l' /L,iAt) = sirj, with small At > for all 
I' £ C in the limit L, W -+ oo with 7 = W/L -> 0. For x £ (0, 1), 



1 



■7 2 A(s) 



ds 
dx 



+7 2 #fe?)+0(7 3 ,A^ 2 ), (107) 



. ds 1 

dx \ dx 

in the limit L, W — > 00 with 7 = W/L — > 0. In ( 17071 ), B(s) and A(s) are given by ( 152 D and 

12{ct 2 + /3£(s)} J 

respectively, where £(s) denotes the MSE for the posterior mean estimator of the BPSK data symbol transmitted through the 
scalar AWGN channel with SNR s. 

Proof of Lemma\3\ We focus on the positions distant from the boundaries, i.e. W < I' < L — 1 — W. In these positions, 
(/ + l') L =1 + 1', A+c = P, and (1 + 1'- V) L = 1 + 1' -V hold for all I, V = 0, . . . , W. Expanding tf ^ = ^{saf x _ v ) = 
£(s((l' + I — l')/L,t)), with t = iAt, up to the second order around x = I'/L, we obtain 

dsl-V ld 2 s fl-l'Y 



Substituting (1109b into (04) yields 



n W+l ^ L 



=0 



2 



nw&' + r(8./fciiJM| +0(7 3 } . (110) 



Evaluating the inverse of (11 10b with 



2(W + 1) 

l'=0 



' ' h , x+ ( h '-l.\ x - + 0{x i ) 1 (!!!) 



a + 6a; + ca; 2 a a 2 



as x — >■ 0, we obtain 



1 _ 1 ft'ds/dx 1 y^I-V 

a 2 +l {i + 1) ~ ^T^C " K + /^) 2 ^+1 ^ ~T~ 

/3(Cd 2 s/dx 2 + t"(ds/dx) 2 ) 1 ^ - r\ 2 
2((T 2 + /?o 2 w + 1 ^ ^ £ y 

(i3Cds/dx) 2 I 1 ^ 



(a 2 +/3^ \W 

■ l'=Q 



L_f +0(7 »,. 



(112) 
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Substituting this expression into ((43) yields 



s(x,t + At) 



1 



(PCds/dx) 



with 



m'd 2 s/dx* + z"(ds/dxY) 3 
~° 2 + 0{1 } ' 



1 w ( 1 w ~,\ 
mw + i) ^ wTi ^ ~ ~ V) 

'=o \ i'=a J 



— >• 



12 : 



Co 



(113) 



(114) 



(115) 



in the limit L, oo with 7^0. Applying s(x, t + At) = s(x, t) + Atds/dt + 0(At 2 ) to (11131 , we arrive at (|1071 i. ■ 

We shall derive Theorem|7]from Lemma|3] In order to transform (1107b into the canonical system (0 with A{u) = —B'(u)/2, 
we calculate (O with ( 1108b and ( l52l to obtain 



D( S ) 



^irlb ds = lnK+ms)} + ^ n6 ' 



where we have selected (ln6)/2 as the constant of integration. Let us define /(s) as 

/(«) = y"ocp|i5(») + ilnB(8)Jdfl, 



which reduces to 



ds. 



(116) 



(117) 



(118) 



As mentioned in Section [III letting u = f(s) transforms (1107b into 



M du _ f&{u) fdu 

dt y J 2 



dx 



d 



dx 



du 



+TirAB{u)-\+0{ 1 \At 



dx 



with B(u) = B(f 1 (m)). In ( II 19b . the derivative of the potential V(u) is given by 

V(u) = f(s) ' 



(119) 



(120) 



with s = f 1 (u). 

In order to complete the proof of Theorem |7] we evaluate the potential V(u). Integrating ( 1120b by the transformation of 
variable u = f(s), we obtain 



V(u) = / f{s) 



1 



0* + ft(8) 



ds 



+#(«)}- / M'(s)ds 



_1 
~2 

where we have used ( 11 18b . Integrating the second term by parts yields 

1 



V(u) = - [ln{a 2 n + ft(s)} - Ps((s) + 2/3C7( S ) + A] , 



(121) 



(122) 



with a constant A. In the derivation of (1122b . we have used (|47b . Selecting A = — In a 2 , we find that the potential ( 1122b is 
equal to the free energy given by d48b . 
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